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NUCLEIC ACID SEQUENCE ENCODING BETA-C-4-OXYGENASE FROM HAEMATOCOCCUS PLUVIALIS FOR THE 
BIOSYNTHESIS OF ASTAXANTHIN 



FTRT.D AND BAC KGROU N D OF THE INVENTION 

The present invention relates, in general, to a biotechnologicai method for 
production of (3S,yS) astaxanthin. In particular, the present invention relates to a 

10 peptide having a p-C-4-oxygenase activity; a DNA segment coding for this 
peptide; an RNA segments coding for this peptide; a recombinant DNA molecule 
comprising a vector and the DNA segment; a host cell or organism containing the 
above described recombinant DNA molecule or DNA segment; and to a method of 
biotechnologically producing (3*^,3 '5^ astaxanthin or a food additive containing 

15 (3S,yS) astaxanthin, using the host. 

Carotenoids, such as astaxanthin, are natural pigments that are responsible 
for many of the yellow, orange and red colors seen in living organisms. 
Carotenoids are widely distributed in nature and have, in various living systems, 
two main biological functions: they serve as light-harvesting pigments in 

20 photosynthesis, and they protect against photooxidative damage. These and 
additional biological functions of carotenoids, their important industrial role, and 
their biosynthesis are discussed hereinbelow. 

As part of the light-harvesting antenna, carotenoids can absorb photons and 
transfer the energy to chlorophyll, thus assisting in the harvesting of light in the 

25 range of 450 - 570 nm [see, Cogdell RJ and Frank HA (1987) How carotenoids 
function in photosynthestic bacteria. Biochim Biophys Acta 895: 63-79; Cogdell R 
(1988) The function of pigments in chloroplasts. In: Goodwin TW (ed) Plant 
Pigments, pp 183-255. Academic Press, London; Frank HA, Violette CA, 
Trautman JK, Shreve AP, Owens TG and Albrecht AC (1991) Carotenoids in 

30 photosynthesis: structure and photochemistry. Pure Appl Chem 63: 109-1 14; Frank 
HA, Farhoosh R, Decoster B and Christensen RL (1992) Molecular features that 
control the efficiency of carotenoid-to-chlorophyll energy transfer in 
photosynthesis. In: Murata N (ed) Research in Photosynthesis, Vol I, pp 125-128. 
Kluwer, Dordrecht; and, Cogdell RJ and Gardiner AT (1993) Functions of 

35 carotenoids in photosynthesis. Meth Enzymol 214: 185-193]. Although 
carotenoids are integral constituents of the protein-pigment complexes of the light- 
harvesting antennae in photosynthetic organisms, they are also important 
components of the photosynthetic reaction centers. 
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Most of the total carotenoids is located in the light harvesting complex II 
[Bassi R, Pineaw B, Dainese P and Marquartt J (1993) Carotenoid binding proteins 
of photosystem II. Eur J Biochem 212: 297-302]. The identities of the 
photosynthetically active carotenoproteins and their precise location in light- 
harvesting systems are not known. Carotenoids in photochemically active 
chlorophyll-protein complexes of the thermophilic cyanobacterium Synechococcus 
sp. were investigated by linear dichroism spectroscopy of oriented samples [see, 
Breton J and Kato S (1987) Orientation of the pigments in photosystem II: low- 
temperature linear-dichroism study of a core particle and of its chlorophyll-protein 
subunits isolated from Synechococcus sp. Biochim Biophys Acta 892: 99-107]. 
These complexes contained mainly a p-carotene pool absorbing around 505 and 
470 nm, which is oriented close to the membrane plane. In photochemically 
inactive chlorophyll-protein complexes, the P-carotene absorbs around 495 and 
465 nm, and the molecules are oriented perpendicular to the membrane plane. 

Evidence that carotenoids are associated with cyanobacterial photosystem 
(PS) II has been described [see, Suzuki R and Fujita Y (1977) Carotenoid 
photobleaching induced by the action of photosynthetic reaction center II: DCMU 
sensitivity. Plant Cell Physiol 18: 625-631; and, Newman PJ and Sherman LA 
(1978) Isolation and characterization of photosystem I and II membrane particles 
from the blue-green alga Synechococcus cedrorum, Biochim Biophys Acta 503: 
343-361]. There are two p-carotene molecules in the reaction center core of PS II 
[see, Ohno T, Satoh K and Katoh S (1986) Chemical composition of purified 
oxygen-evolving complexes from the thermophilic cyanobacterium Synechococcus 
sp. Biochim Biophys Acta 852: 1-8; Gounaris K, Chapman DJ and Barber J (1989) 
Isolation and characterization of a Dl/D2/cy to chrome b-559 complex from 
Synechocystis PCC6803. Biochim Biophys Acta 973: 296-301; and, Newell RW, 
van Amerongen H, Barber J and van Grondelle R (1993) Spectroscopic 
cheiracterization of the reaction center of photosystem II using polarized light: 
Evidence for (5-carotene excitors in PS II reaction centers. Biochim Biophys Acta 
1057: 232-238] whose exact function(s) is still obscure [reviewed by Satoh K 
(1992) Structure and function of PS II reaction center. In: Murata N (ed) Research 
in Photosynthesis, Vol. II, pp. 3-12. Kluwer, Dordrecht]. It was demonstrated that 
these two coupled P-carotene molecules protect chlorophyll P680 from 
photodamage in isolated PS II reaction centers [see, De Las Rivas J, Telfer A and 
Barber J (1993) 2-coupled P-carotene molecules protect P680 from photodamage 
in isolated PS II reaction centers. Biochim. Biophys. Acta 1 142: 155-164], and this 
may be related to the protection against degradation of the Dl subunit of PS II 
[see, Sandmann G (1993) Genes and enzymes involved in the desaturation 
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reaction? from phytoene to lycopene. (abstract), 10th International Symposium on 
Carotenoids, Trondheim CLl-2]. The light-harvesting pigments of a highly 
purified, oxygen-evolving PS II complex of the thermophilic cyanobacterium 
Synechococcus sp. consists of 50 chlorophyll a and 7 P-carotene, but no 
xanthophylU molecules [see, Ohno T, Satoh K and Katoh S (1986) Chemical 
composition of purified oxygen-evolving complexes from the thermophilic 
cyanobacterium Synechococcus sp. Biochim Biophys Acta 852: 1-8]. P-carotene 
was shown to play a role in the assembly of an active PS II in green algae [see, 
Humbeck K, Romer S and Senger H (1989) Evidence for the essential role of 
carotenoids in the assembly of an active PS IL Planta 179: 242-250], 

Isolated complexes of PS I from Phormidium luridum, which contained 40 
chlorophylls per P700, contained an average of 1.3 molecules of (3-carotene [see, 
Thornber JP, Alberte RS, Hunter FA, Shiozavva JA and Kan KS (1976) The 
organization of chlorophyll in the plant photosynthetic unit, Brookhaven Symp 
Biology 28: 132-148]. In a preparation of PS I particles from Synechococcus sp. 
strain PCC 6301, which contained 130 ± 5 molecules of antenna chlorophylls per 
P700, 16 molecules of carotenoids were detected [see, Lundell DJ, Glazer AN, 
Melis A and Malkin R (1985) Characterization of a cyanobacterial photosystem I 
complex, J Biol Chem 260: 646-654]. A substantial content of p-carotene and the 
xanthophylls cryptoxanthin and isocryptoxanthin were detected in PS I pigment- 
protein complexes of the thermophilic cyanobacterium Synechococcus elongatus 
[see, Coufal J, Hladik J and Sofrova D (1989) The carotenoid content of 
photosystem 1 pigment-protein complexes of the cyanobacterium Synechococcus 
elongatus. Photosynthetica 23: 603-616]. A subunit protein-complex structure of 
PS I from the thermophilic cyanobacterium Synechococcus sp., which consisted of 
four polypeptides (of 62, 60, 14 and 10 kDa), contained approximately 10 (3- 
carotene molecules per P700 [see, Takahashi Y, Hirota K and Katoh S (1985) 
Multiple forms of P700-chlorophyll a-protein complexes from Synechococcus sp.: 
the iron, quinone and carotenoid contents. Photosynth Res 6: 183-192], This 
carotenoid is exclusively bound to the large polypeptides which carry the 
functional and anteima chlorophyll a. The fluorescence excitation spectrum of 
these complexes suggested that p-carotene serves as an efficient antenna for PS I. 

As mentioned, an additional essential function of carotenoids is to protect 
against photooxidation processes in the photosynthetic apparatus that are caused 
by the excited triplet state of chlorophyll. Carotenoid molecules with 7c-electron 
conjugation of nine or more carbon-carbon double bonds can absorb triplet-state 
energy from chlorophyll and thus prevent the formation of harmful singlet-state 
oxygen radicals. In Synechococcus sp. the triplet state of carotenoids was 
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monitored in closed PS II centers and its rise kinetics of approximately 25 
nanoseconds is attributed to energy transfer from chlorophyll triplets in the antenna 
[see, Schlodder E and Brettel K (1988) Primary charge separation in closed 
photosystem II with a lifetime of 1 1 nanoseconds. Flash-absorption spectroscopy 

5 with oxygen-evolving photosystem II complexes from Synechococcus. Biochim 
Biophys Acta 933: 22-34]. It is conceivable that this process, that has a lower 
yield compared to the yield of radical-pair formation, plays a role in protecting 
chlorophyll from damage due to over-excitation. 

The protective role of carotenoids in vivo has been elucidated through the 

10 use of bleaching herbicides such as norflurazon that inhibit carotenoid biosynthesis 
in all organisms performing oxygenic photosynthesis [reviewed by Sandmann G 
and Boger P (1989) Inhibition of carotenoid biosynthesis by herbicides. In: Boger 
P and Sandmann G (Eds.) Target Sites of Herbicide Action, pp 25-44. CRC Press, 
Boca Raton, Florida]- Treatment with norflurazon in the light results in a decrease 

15 of both carotenoid and chlorophyll levels, while in the dark, chlorophyll levels are 
unaffected. Inhibition of photosynthetic efficiency in cells of Oscillatoria agardhii 
that were treated with the pyridinone herbicide, fluridone, was attributed to a 
deci-ease in the relative abundance of myxoxanthophyll, zeaxanthin and P- 
carotene, which in turn caused photooxidation . of chlorophyll molecules [see, 

20 Canto de Loura I, Dubacq JP and Thomas JC (1987) The effects of nitrogen 
deficiency on pigments and lipids of cianobacteria. Plant Physiol 83: 838-843], 

It has been demonstrated in plants that zeaxanthin is required to dissipate, in 
a nonradiative manner, the excess excitation energy of the antenna chlorophyll 
[see, Pemmig- Adams B (1990) Carotenoids and photoprotection in plants: a role 

25 for the xanthophyll zeaxanthin. Biochim Biophys Acta 1020: 1-24; and, Demmig- 
Adams B and Adams WW III (1990) The carotenoid zeaxanthin and high-energy- 
state quenching of chlorophyll fluorescence. Photosynth Res 25: 187-197]. In 
algae and plants a light-induced deepoxidation of violaxanthin to yield zeaxanthin, 
is related to photoprotection processes [reviewed by Demmig-Adanls B and 

30 Adams WW III (1992) Photoprotection and other responses of plants to high light 
stress. Ann Rev Plant Physiol Plant Mol Biol 43: 599-626]. The light-induced 
deepoxidation of violaxanthin and the reverse reaction that takes place in the dark, 
are known as the "xanthophyll cycle" [see, Demmig- Adams B and Adams WW III 
(1992) Photoprotection and other responses of plants to high light stress. Ann Rev 

35 Plant Physiol Plant Mol Biol 43; 599-626]. Cyanobacterial Hchens, that do not 
contain any zeaxanthin and that probably are incapable of radiationless energy 
dissipation, are . sensitive to high light intensity; algal lichens that contain 
zeaxanthin are more resistant to high-light stress [see, Demmig-Adams B, Adams 
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WW III, Green TGA, Czygan FC and Lange OL (1990) Differences in the 
susceptibility to light stress in two hchens forming a phycosymbiodeme, one 
partner possessing and one lacking the xanthophyll cycle. Oecologia 84: 451-456: 
Demmig- Adams B and Adams WW III (1993) The xanthophyll cycle, protein 

5 turnover, and the high light tolerance of sun-acclimated leaves. Plant Physiol 103: 
1413-1420; and, Demmig-Adams B (1990) Carotenoids and photoprotection in 
plants: a role for the xanthophyll zeaxanthin. Biochim Biophys Acta 1020: 1-24]. 
In contrast to algae and plants, cyanobacteria do not have a xanthophyll cycle. 
However, they do contain ample quantities of zeaxanthin and other xanthophylls 

10 that can support photoprotection of chlorophyll. 

Several other functions have been ascribed to carotenoids. The possibility 
that carotenoids protect against damaging species generated by near ultra-violet 
(UV) irradiation is suggested by results describing the accumulation of p-carotene 
in a UV-resistant mutant of the cyanobacterium Gloeocapsa alpicola [see, Buckley 

15 CE and Houghton J A (1976) A study of the effects of near UV radiation on the 
pigmentation of the blue-green alga Gloeocapsa alpicola. Arch Microbiol 107: 93- 
97]. This has been demonstrated more elegantly in Escherichia coli cells that 
produce carotenoids [see, Tuveson RW and Sandmann G (1993) Protection by 
cloned carotenoid genes expressed in Escherichia coli against phototoxic 

20 molecules activated by near-ultraviolet light. Meth Enzymol 214: 323-330]. Due 
to their ability to quench oxygen radical species, carotenoids are efficient anti- 
oxidants and thereby protect cells from oxidative damage. This function of 
carotenoids is important in virtually all organisms [see, Krinsky NI (1989) 
Antioxidant functions of carotenoids* Free Radical Biol Med 7: 617-635; and, 

25 Palozza P and Krinsky NI (1992) Antioxidant effects of carotenoids in vivo and in 
vitro - an overview. Meth Enzymol 213: 403-420]. Other cellular functions could 
be affected by carotenoids, even if indirectly. Although carotenoids in 
cyanobacteria are not the major photoreceptors for phototaxis, an influence of 
carotenoids on phototactic reactions, that have been observed in Anabaena 

30 variabilis^ was attributed to the removal of singlet oxygen radicals that may act as , 
signal intermediates in this system [see, Nultsch W and Schuchart H (1985) A 
model of the phototactic reaction chain of cyanobacterium Anabaena variabilis. 
Arch Microbiol 142: 180-184]. 

In flowers and fruits carotenoids facilitate the attraction of pollinators and 

35 dispersal of seeds. This latter aspect is strongly associated with agriculture. The 
type and degree of pigmentation in fruits and flowers are among the most 
important traits of many crops. This is mainly since the colors of these products 
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often determine their appeal to the. consumers and thus can increase their market 
worth. 

Carotenoids have important commercial uses as coloring agents in the food 
industry since they are non-toxic [see, Bauemfeind JC (1981) Carotenoids as 

5 colorants and vitamin A precursors. Academic Press, London]. The red color of 
the tomato fruit is provided by lycopene which accumulates during fruit ripening 
in chromoplasts. Tomato extracts, which contain high content (over 80% dry 
weight) of lycopene, are commercially produced worldwide for industrial use as 
food colorant. Furthermore, the flesh, feathers or eggs of fish and birds assume the 

10 color of the dietary carotenoid provided, and thus carotenoids are frequently used 
in dietary additives for poultry and in aquaculture. Certain cyanobacterial species, 
for example Spirulina sp. [see, Sommer TR, Potts WT and Morrissy NM (1990) 
Recent progress in processed microalgae in aquaculture. Hydrobiologia 204/205: 
435-443], are cultivated in aquaculture for the production of animal and human 

15 food supplements. Consequently, the content pf carotenoids, primarily of p- 
carotene, in these cyanobacteria has a major commercial implication in 
biotechnology. 

Most carotenoids are composed of a C40 hydrocarbon backbone, 

constructed from eight C5 isoprenoid units and contain a series of conjugated 

20\ double bonds. Carotenes do not contain oxygen atoms and are either linear or 

/ cyclized molecules containing one or two end rings. Xanthophylls are oxygenated 

derivatives of carotenes. Various glycosilated carotenoids and carotenoid esters 

^\ 

have been identified. The C40 backbone can be ftirther extended to give C45 or 

C50 carotenoids, or shortened yielding apocarotenoids. Some nonphotosynthetic 

25 bacteria also synthesize C30 carotenoids. General background on carotenoids can 

be found in Goodwin TW (1980) The Biochemistry of the Carotenoids, Vol. 1, 2nd 

Ed. Chapman and Hall, New York; and in Goodwin TW and Britton G (1988) 

Distribution and analysis of carotenoids. In: Goodwin TW (ed) Plant Pigments, pp 

62-132. Academic Press, New York. 

30 More than 640 different naturally-occurring carotenoids have been so far 

characterized, hence, carotenoids are responsible for most of the various shades of 

yellow, orange and red found in microorganisms, fungi, algae, plants and animals. 

Carotenoids are synthesized by all photosynthetic organisms as well as several 

nonphotosynthetic bacteria and fungi, however they are also widely distributed 

35 through feeding throughout the animal kingdom. 

Carotenoids are synthesized de novo from isoprenoid precursors only in 

photosynthetic organisms and some microorganisms, they typically accumulate in 
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protein complexes in the photosynthetic membrane, in the cell membrane and in 
the cell wall. 

As detailed in Figure 1, in the biosynthesis pathway of p-carotene, four 
enzymes convert geranylgeranyl pyrophosphate of the central isoprenoid pathway 
to P-carotene. Carotenoids are produced from the general isoprenoid biosynthetic 
pathway. While this pathway has been known for several decades, only recently, 
and mainly through the use of genetics and molecular biology, have some of the 
molecular mechanisms involved in carotenoids biogenesis, been elucidated. This 
is due to the fact that most of the enzymes Vs/hich take part in the conversion of 
phytoene to carotenes and xanthophylls are labile, membrane-associated proteins 
that lose activity upon solubilization [see, Beyer P, Weiss G and Kleinig H (1985) 
Solubilization and reconstitution of the membrane-bound caroteaogenic enzymes 
from daffodile chromoplasis. Eur J Biochem 153: 341-346; and, Bramley PM 
(1985) The in vitro biosynthesis of carotenoids. Adv Lipid Res 21: 243-279]. 
However, solubilization of carotenogenic enzymes from Synechocystis sp. strain 
FCC 6714 that retain partial activity has been reported [see, Bramley PM and 
Sandmann G (1987) Solubilization of carotenogenic enzyme of Aphanocapsa, 
Phytochem 26: 1935-1939]. There is no genuine in vitro system for carotenoid 
biosynthesis which enables a direct essay of enzymatic activities. A ceil-free 
carotenogenic system has been developed [see, Clarke IE, Sandmann G, Bramley 
PM and Boger P (1982) Carotene biosynthesis with isolated photosynthetic 
membranes. FEBS Lett 140: 203-206] and adapted for cyanobacteria [see, 
Sandmann G and Bramley PM (1985) Carotenoid biosynthesis by Aphanocapsa 
homogenates coupled to a phytoene- generating system from Pl^ycomyces 
blakesleeanus, Planta 164: 259-263; and, Bramley PM and Sandmann G (1985) In 
vitro and in vivo biosynthesis of xanthophylls by the cyanobacterium 
Aphanocapsa. Phytochem 24: 2919-2922]. Reconstitution of phytoene desaturase 
from Synechococcus sp. strain PCC 7942 in liposomes was achieved following 
purification of the polypeptide, that had been expressed in Escherichia coli [see, 
Fraser PD, Linden H and Sandmann G (1993) Purification and reactivation of 
recombinant Synechococcus phytoene desaturase from an overexpressing strain of 
Escherichia coli. Biochem J 29 1 : 687-692]. 

Referring now to Figure 1, carotenoids are synthesized from isoprenoid 
precursors. The central pathway of isoprenoid biosynthesis may be viewed as 
beginning with the conversion of acetyl-CoA to mevalonic acid. D^-isopentenyl 
pyrophosphate (IFF), a C5 molecule, is formed from mevalonate and is the 
building block for all long-chain isoprenoids. Following isomerization of IPP to 
dimethylallyl pyrophosphate (DMAPP), three additional molecules of IPP are 
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combined to yield the C20 molecule, geranylgeranyl pyrophosphate (GGPP). 
These r-4 condensation reactions are catalyzed by prenyl transferases [see, 
Kleinig H (1989) The role of plastids in isoprenoid biosynthesis. Ann Rev Plant 
Physiol Plant Mol Biol 40: 39-59]. There is evidence in plants that the same 
enzyme, GGPP synthase, carries out all the reactions from DMAPP to GGPP [see, 
Dogbo O and Camara B (1987) Purification of isopentenyl pyrophosphate 
isomerase and geranylgeranyl pyrophosphate synthase from 'Capsicum 
chromoplasts by affinity chromatography, Biochim Biophys Acta 920: 140-148; 
and, Laferriere A and Beyer P (1991) Purification of geranylgeranyl diphosphate 
synthase from Sinapis alba etioplasts. Biochim Biophys Acta 216: 1 56-163], 

The first step that is specific for carotenoid biosynthesis is the head-to-head 
condensation of two molecules of GGPP to produce prephytoene pyrophosphate 
(PPPP). Following removal of the pyrophosphate, GGPP is converted to 
phytoene, a colorless C40 hydrocarbon molecule. This two-step reaction is 
catalyzed by the soluble enzyme, phytoene synthase, an enzyme encoded by a 
single gene (cr/B), in both cyanobacteria and plants [see, Chamovitz D, Misawa N, 
Sandmann G and Hirschberg J (1992) Molecular cloning and expression in 
Escherichia coli of a cyanobacterial gene coding for phytoene synthase, a 
carotenoid biosynthesis enzyme. FEBS Lett 296: 305-310; Ray JA, Bird CR, 
Maunders M, Grierson D and Schuch W (1987) Sequence of pTOM5, a ripening 
related cDNA from tomato. Nucl Acids Res 15: 10587-10588; Camara B (1993) 
Plant phytoene synthase complex - component 3 enzymes, immunology, and 
biogenesis. Meth Enzymol 214: 352-365], All the subsequent steps in the pathway 
occur in membranes. Four desaturation (dehydrogenation) reactions convert 
phytoene to lycopene via phytofluene, il^-carotene, and neurosporene. Each 
desaturation increases the number of conjugated double bonds by two such that 
the number of conjugated double bonds increases from three in phytoene to eleven 
in lycopene. 

Relatively little is known about the molecular mechanism of the enzymatic 
dehydrogenation of phytoene [see, Jones BL and Porter JW (1986) Biosynthesis of 
carotenes in higher plants. CRC Grit Rev Plant Sci 3: 295-324; and, Beyer P, 
Mayer M and Kleinig H (1989) Molecular oxygen and the state of geometric 
iosomerism of intermediates are essential in the carotene desaturation and 
cyclization reactions in daffodil chromoplasts. Eur J Biochem 184: 141-150]. It 
has been established that in cyanobacteria, algae and plants the first two 
desaturations, from 1 5-c/5-phytoene to <!^-carotene, are catalyzed by a single 
membrane-bound enzyme, phytoene desaturase [see, Jones BL and Porter JW 
(1986) Biosynthesis of carotenes in higher plants. CRC Crit Rev Plant Sci 3: 295- 
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324; and, Beyer P, Mayer M and Kleinig H (1989) Molecular oxygen and the state 
of geometric iosomerism of intermediates are essential in the carotene desaturation 
and cyclization reactions in daffodil chromoplasts. Eur J Biochem 184: 141-150], 
Since the (!^-carotene product is mostly in the edUtrans configuration, a cis-trans 

5 isomerization is presumed at this desaturation step. The primary structure of the 
phytoene desaturase polypeptide in cyanobacteria is conserved (over 65% identical 
residues) with that of algae and plants [see, Pecker 1, Chamovitz D, Linden H, 
Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to f^-carotene is transcriptionally regulated during tomato 

10 fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966; Pecker 1, Chamovitz D, 
Mann V, Sandmann G, Boger P and Hirschberg J (1993) Molecular 
characterization of carotenoid biosynthesis in plants: the phytoene desaturase gene 
in tomato. In: Murata N (ed) Research in Photosynthesis, Vol III, pp 11-18. 
Kluwer, Dordrectht]. Moreover, the same inhibitors block phytoene desaturase in 

15 the two systems [see, Sandmann G and Boger P (1989) Inhibition of carotenoid 
biosynthesis by herbicides. In: Boger P and Sandmann G (eds) Target Sites of 
Herbicide Action, pp 25-44. CRC Press, Boca Raton, Florida]. Consequently, it is 
very likely that the enzymes catalyzing the desaturation of phytoene and 
phytofluene in cyanobacteria and plants have similar biochemical and molecular 

20 properties, that are distinct from those of phytoene desaturases in other 
microorganisms. One such a difference is that phytoene desaturases from 
Rhodobacter capsulatus, Erwinia sp. or fungi convert phytoene to neurosporene, 
lycopene, or 3,4-dehydrolycopene, respectively. 

Desaturation of phytoene in daffodil chromoplasts [see, Beyer P, Mayer M 

25 and Kleinig H (1989) Molecular oxygen and the state of geometric iosomerism of 
intermediates are essential in the carotene desaturation and cyclization reactions in 
daffodil chromoplasts. Eur J Biochem 184: 141-150], as well as in a cell free 
system of Synechococcus sp. strain FCC 7942 [see, Sandmann G and Kowalczyk S 
(1989) In vitro carotenogenesis and characterization of the phytoene desaturase 

30 reaction in Anacystis. Biochem Biophys Res Com 163: 916-921], is dependent on 
molecular oxygen as a possible final electron acceptor, although oxygen is not 
directly involved in this reaction. A mechanism of dehydrogenase-electron 
transferase was supported in cyanobacteria over dehydrogenation mechanism of 
dehydrogenase-monooxygenase [see, Sandmann G and Kowalczyk S (1989) In 

35 vitro carotenogenesis and characterization of the phytoene desaturase reaction in 
Anacystis, Biochem Biophys Res Com 163: 916-921]. A conserved FAD-binding 
motif exists in all phytoene desaturases whose primary structures have been 
analyzed [see. Pecker I, Chamovitz D, Linden H, Sandmann G and Hirschberg J 




wo 98/18910 PCT/US97/17819 

10 

(1992) A single polypeptide catalyzing the conversion of phytoene to i^-carotene is 
transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966; Pecker I, Chamovitz D, Mann V, Sandmann G, Boger P and 
Hirschberg J (1993) Molecular characterization of carotenoid biosynthesis in 

5 plants: the phytoene desaturase gene in tomato. In: Murata N (ed) Research in 
Photosynthesis, Vol III, pp 11-18, Kluwer, Dordrectht]. The phytoene desaturase 
enzyme in pepper was shown to contain a protein-bound FAD [see, Hugueney P, 
Romer S, Kuntz M and Camara B (1992) Characterization and molecular cloning 
of a flavoprotein catalyzing the synthesis of phytofluene and d^-carotene in 

10 Capsicum chromoplasts. Eur J Biochem 209: 399-407]. Since phytoene desaturase 
is located in the membrane, an additional, soluble redox component is predicted. 
This hypothetical component could employ NAD(P)"^, as suggested [see, Mayer 
MP, Nievelstein V and Beyer P (1992) Purification and characterization of a 
NADPH dependent oxidoreductase from chromoplasts of Narcissus 

15 pseudonarcissus - a redox-mediator possibly involved in carotene desaturation. 
Plant Physiol Biochem 30: 389-398] or another electron and hydrogen carrier, such 
as a quinone. The cellular location of phytoene desaturase in Synechocystis sp. 
• strain PCC 6714 and Anabaena variabilis strain ATCC 29413 was determined 
with specific antibodies to be mainly (85%) in the photosynthetic thylakoid 

20 membranes [see, Serrano A, Gimenez P, Schmidt A and Sandmann G (1990) 
Immunocytochemical localization and functional determination of phytoene 
desaturase in photoautotrophic prokaryotes. J Gen Microbiol 136: 2465-2469]. 

In cyanobacteria algae and plants (^-carotene is converted to lycopene via 
neurosporene. Very little is known about the enzymatic mechanism, which is 

25 predicted to be carried out by a single enzyme [see. Linden H, Vioque A and 
Sandmann G (1993) Isolation of a carotenoid biosynthesis gene coding for C^- 
carotene desaturase from Anabaena PCC 7120 by heterologous complementation. 
FEMS Microbiol Lett 106: 99-104], The deduced amino acid sequence of C,- 
carotene desaturase in Anabaena sp. strain PCC 7120 contains a dinucleotide- 

30 binding motif that is similar to the one found in phytoene desaturase. 

Two cyclization reactions convert lycopene to p-carotene. Evidence has 
been obtained that in Synechococcus sp. strain PCC 7942 [see, Cunningham FX Jr, 
Chamovitz D, Misawa N, Gantt E and Hirschberg J (1993) Cloning and functional 
expression in Escherichia coli of a cyanobacterial gene for lycopene cyclase, the 

35 enzyme that catalyzes the biosynthesis of p-carotene. FEBS Lett 328: 130-138], as 
well as in plants [see, Camara B and Dogbo O (1986) Demonstration and 
solubilization of lycopene cyclase from Capsicum chromoplast membranes. Plant 
Physiol 80: 172-184], these two cyclizations are catalyzed by a single enzyme. 
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lycopene cyclase. This membrane-bound enzyme is inhibited by the triethylamine 
compounds, CPTA andMPTA [see, Sandmann G and Boger P (1989) Inhibition of 
carotenoid biosynthesis by herbicides. In: Boger P and Sandmann G (eds) Target 
Sites of Herbicide Action, pp 25-44. CRC Press, Boca Raton, Florida]. 

5 Cyanobacteria carry out only the P-cyclization and therefore do not contain e- 
carotene, 5-carotene and a-carotene and their oxygenated derivatives. The P-ring 
is formed through the formation of a "carbonium ion" intemiediate when the C-1,2 
double bond at the end of the linear lycopene molecule is folded into the position 
of the C-5,6 double bond, followed by a loss of a proton from C-6. No cyclic 

10 carotene has been reported in which the 7,8 bond is not a double bond. Therefore, 
full desaturation as in lycopene, or desaturation of at least half-molecule as in 
neurosporene, is essential for the reaction. Cyclization of lycopene involves a 
dehydrogenation reaction that does not require oxygen. The cofactor for this 
reaction is unknown. A dinucleotide-binding domain was found in the lycopene 

15 cyclase polj^peptide of Synechococciis sp. strain PCC 7942, implicating NAD(P) or 
FAD as coenzymes with lycopene cyclase. 

The addition of various oxygen-containing side groups, such as hydroxy-, 
methoxy-, oxo-, epoxy-, aldehyde or carboxylic acid moieties, form the various 
xanthophyll species. Litde is known about the formation of xanthophylls. 

20 Hydroxylation of p-carotene requires molecular oxygen in a mixed-function 
oxidase reaction. 

Clusters of genes encoding the enzymes for the entire pathway have been 
cloned from the purple photosynthetic bacterium Rhodobacter capsulatus [see, 
Armstrong GA, Alberti M, Leach F and Hearst JE (1989) Nucleotide sequence. 

25 organization, and nature of the protein products of the carotenoid biosynthesis 
gene cluster of Rhodobacter capsulatus. Mol Gen Genet 216: 254-268] and from 
the nonphotosynthetic bacteria Erwinia herbicola [see, Sandmann G, Woods WS 
and Tuveson RW (1990) Identification of carotenoids in Erwinia herbicola and in 
transformed Escherichia coli strain. FEMS Microbiol Lett 71: 77-82; Hundle BS, 

30 Beyer P, Kleinig H, Englert H and Hearst JE (1991) Carotenoids of Erwinia 
herbicola and an Escherichia coli HBlOl strain carrying the Erwinia herbicola 
carotenoid gene cluster. Photochem Photobiol 54: 89-93; and, Schnurr G, Schmidt 
A and Sandmann G (1991) Mapping of a carotenogenic gene cluster from Erwinia 
herbicola and functional identification of six genes. FEMS Microbiol Lett 78: 157- 

35 162] and Erwinia uredovora [see, Misawa N, Nakagawa M, Kobayashi K, 
Yamano S, Izawa I, Nakamura K and Harashima K (1990) Elucidation of the 
Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of 
gene products in Escherichia coli. J Bacteriol 172: 6704-6712]. Two genes, a/-5 
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for GGPP synthase [see. Nelson MA, Morelli G, Carattoli A, Romano N and 
Macino G (1989) Molecular cloning of a Neurospora crassa carotenoid 
biosynthetic gene (albino~3) regulated by blue light and the products of the white 
collar genes, Mol Cell Biol 9: 1271-1276; and, Carattoli A, Romano N, Baliario P, 

5 Morelli G and Macino G (1991) The Neurospora crassa carotenoid biosynthetic 
gene (albino 3). J Biol Chem 266: 5854-5859] and al-J for phytoene desaturase 
[see, Schmidhauser TJ, Lauter FR, Russo VEA and Yanofsky C (1990) Cloning 
. sequencing and photoregulation qf a/-7, a carotenoid biosynthetic gene of 
Neurospora crassa. Mol Cell Biol 10: 5064-5070] have been cloned from the 

10 fungus Neurospora crassa. However, attempts at using these genes as 
heterologous molecular probes to clone the corresponding genes from 
cyanobacteria or plants were unsuccessful due to lack of sufficient sequence 
similarity. 



15 from cyanobacteria using a molecular-genetics approach. In the first step towards 
cloning the gene for phytoene desaturase, a number of mutants that are resistant to 
the phytoene-desaturase-specific inhibitor, norflurazon, were isolated in 
Synechococcus sp. strain PCC 7942 [see^ Linden H, Sandmann G, Chamovitz D, 
Hirschberg J and Boger P (1990) Biochemical characterization of Synechococcus 

20 mutants selected against the bleaching herbicide norflurazon. Pestic Biochem 
Physiol 36: 46-51]. The gene conferring norflurazon-resistance was then cloned 
by transforming the wild-type strain to herbicide resistance [see, Chamovitz D, 
Pecker I and Hirschberg J (1991) The molecular basis of resistance to the herbicide 
norflurazon. Plant Mol Biol 16: 967-974; Chamovitz D, Pecker I, Sandmann G, 

25 Boger P and Hirschberg J (1990) Cloning a gene for norflurazon resistance in 
cyanobacteria. Z Naturforsch 45c: 482-486]. Several lines of evidence indicated 
that the cloned gene, formerly called pds and now named crtP, codes for phytoene 
desaturase. The most definitive one was the functional expression of phytoene 
, desaturase activity in transformed Escherichia coli cells [see, Linden H, Misawa 

30 N, Chamovitz D, Pecker I, Hirschberg J and Sandmann G (1991) Functional 
complementation in Escherichia coli of different phytoene desaturase genes and 
analysis of accumulated carotenes. Z Naturforsch 46c: 1045-1051; and. Pecker U 
Chamovitz D, Linden H, Sandmann G and Hirschberg J (1992) A single 
polypeptide catalyzing the conversion of phytoene to t^-carotene is 

35 transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966]. The crtP gene was also cloned from Synechocystis sp. strain 
PCC 6803 by similar methods [see, Martinez-Ferez IM and Vioque A (1992) 
Nucleotide sequence of the phytoene desaturase gene from Synechocystis sp. PCC 
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6803 and characterization of a new mutation which confers resistance to the 
herbicide norflurazon. Plant Mol Biol 18: 981-983]. 

The cyanobacterial crtP gene was subsequently used as a molecular probe 
for cloning the homologous gene from an alga [see. Pecker I, Chamovitz D, Mann 

5 V, Sandmann G, Boger P and Hirschberg J (1993) Molecular characterization of 
carotenoid biosynthesis in plants: the phytoene desaturase gene in tomato. In: 
Murata N (ed) Research in Photosynthesis, Vol III, pp 11-18. Kluwer, Dordrectht] 
and higher plants [see, Hartley GE, Viitanen PV, Pecker I, Chamovitz D, 
Hirschberg J and Scolnik PA (1991) Molecular cloning and expression in 

10 photosynthetic bacteria of a soybean cDNA coding for phytoene desaturase, an 
enzyme of the carotenoid biosynthesis pathway. Proc Natl Acad Sci USA 88: 
6532-6536; and, Pecker I, Chamovitz D, Linden H, Sandmann G and Hirschberg J 
(1992) A single polypeptide catalyzing the conversion of phytoene to (^-carotene is 
transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 

15 89: 4962-4966]. The phytoene desaturases in Synechococcits sp. strain PCC 7942 
and Synechocystis sp. strain PCC 6803 consist of 474 and 467 amino acid residues, 
respectively, whose sequences are highly conserved (74% identities and 86% 
similarities). The calculated molecular mass is. 51 kDa and. although it is slightly 
hydrophobic (hydropathy index -0.2), it does not include a hydrophobic region 

20 which is long enough to span a lipid bilayer membrane. The primary structure of 
the cyanobacterial phytoene desaturase is highly conserved v/ith the enzyme from 
the green alga Dunalliela bardawil (61% identical and 81% similar; [see. Pecker I, 
Chamovitz D, Mann V, Sandmann G, Boger P and Hirschberg J (1993) Molecular 
characterization of carotenoid biosynthesis in plants: the phytoene desaturase gene 

25 in tomato. In: Murata N (ed) Research in Photosynthesis, Vol III, pp 11-18. 
Kluwer, Dordrectht]) and from tomato [see, Pecker I, Chamovitz D, Linden H, 
Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to i^-carotene is transcriptionally regulated during tomato 
fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966], pepper [see, Hugueney P, 

30 Romer S, Kuntz M and Camara B (1992) Characterization and molecular cloning 
of a flavoprotein catalyzing the synthesis of phytofluene and (^-carotene in 
Capsicum chromoplasts. Eur J Biochem 209: 399-407] and soybean [see, Bartley 
GE, Viitanen PV, Pecker I, Chamovitz D, Hirschberg J and Scolnik PA (1991) 
Molecular cloning and expression in photosynthetic bacteria of a soybean cDNA 

35 coding for phytoene desaturase, an enzyme of the carotenoid biosynthesis 
pathway. Proc Natl Acad Sci USA 88: 6532-6536] (62-65% identical and --79% 
similar; [see, Chamovitz D (1993) Molecular analysis of the early steps of 
carotenoid biosynthesis in cyanobacteria: Phytoene synthase and phytoene 
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desaturase. Ph.D. Thesis, The Hebrew University of Jerusalem]). The eukaryotic 
phytoene desaturase polypeptides are larger (64 kDa); however, they are processed 
during import into the plastids to mature forms whose sizes are comparable to 
. those of the cyanobacterial enzymes. 

5 There is a high degree of structural similarity in carotenoid enzymes of 

Rhodobacter capsulatus, Erwinia sp. and Neurospora crassa [reviewed in 
Armstrong GA, Hundle BS and Hearst JE (1993) Evolutionary conservation and 
structural similarities of carotenoid biosynthesis gene products from 
photosynthetic and nonphotosynthetic organisms. Meth Enzymol 214: 297-3 1 1], 

10 including in the crti gene-product, phytoene desaturase. As indicated above, a 
high degree of conservation of the primary structure of phytoene desaturases also 
exists among oxygenic photosynthetic organisms. However, there is little 
sequence similarity, except for the FAD binding sequences at the amino termini, 
between the "plant-type" crtP gene products and the "bacterial-type" phytoene 

15 desaturases (cr//gene products; 19-23% identities and 42A7% similarities). It has 
been hypothesized that crtP and crtI are not derived from the same ancestral gene 
and that they originated independently through convergent evolution [see, Pecker 
I, Chamovitz D, Linden H, Sandmann .G and Hirschberg J (1992) A single 
polypeptide catalyzing the. conversion of phytoene to ^-carotene is 

20 transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966]. This hypothesis is supported by the different dehydrogenation 
sequences that are catalyzed by the two types of enzymes and by their different 
sensitivities to inhibitors. 

Although not as definite as in the case of phytoene desaturase, a similar 

25 distinction between cyanobacteria and plants on the one hand and other 
microorganisms is also seen in the structure of phytoene synthase. The crtB gene 
(formerly psy) encoding phytoene synthase was identified in the genome of 
Synechococcus sp. strain PCC 7942 adjacent to crtP and within the same operon 
[see, Bartley GE, Viitanen PV, Pecker I, Chamovitz D, Hirschberg J and Scolnik 

30 PA (1991) Molecular cloning and expression in photosynthetic bacteria of a 
soybean cDNA coding for phytoene desaturase, an enzyme of the carotenoid 
biosynthesis pathway. Proc Natl Acad Sci USA 88: 6532-6536]. This gene 
encodes a 36-kDa polypeptide of 307 amino acids with a hydrophobic index of - 
0.4. The deduced amino acid sequence of the cyanobacterial phytoene synthase is 

35 highly conserved with the tomato phytoene synthase (57% identical and 70% 
similar; Ray JA, Bird CR, Maunders M, Grierson D and Schuch W (1987) 
Sequence of pTOM5, a ripening related cDNA from tomato. Nucl Acids Res 15: 
10587-10588]) but is less highly conserved with the crtB sequences firom other 
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bacteria (29-32% identical and 48-50% similar with ten gaps in the alignment). 
Both types of enzymes contain two conserved sequence motifs also found in 
prenyl transferases from diverse organisms [see, Bartley GE, Viitanen PV, Pecker 
1, Chamovitz D, Hirschberg J and Scolnik PA (1991) Molecular cloning and 

5 expression in photosynthetic bacteria of a soybean cDNA coding for phytoene 
desaturase, an enzyme of the carotenoid biosynthesis pathway. Proc Natl Acad Sci 
USA 88: 6532-6536; Carattoli A, Romano N, Ballario P, Morelli G and Macino G 
(1991) The Neurospora crassa carotenoid biosynthetic gene (albino 3). J Biol 
Chem 266: 5854-5859; Armstrong GA, Hundle BS and Hearst JE (1993) 

10 Evolutionary conservation and structural similarities of carotenoid biosynthesis 
gene products from photosynthetic and nonphotosynthetic organisms. Meth 
Enzymol 214: 297-311; Math SK, Hearst JE and Poulter CD (1992) The crtE gene 
in Erwinia herbicola encodes geranylgeranyl diphosphate synthase. Proc Natl 
Acad Sci USA 89: 6761-6764; and, Chamovitz D (1993) Molecular analysis of the 

15 early steps of carotenoid biosynthesis in cyanobacteria: Ph^^toene synthase and 
phytoene desaturase. Ph.D. Thesis, The Hebrew University of Jerusalem]. It is 
conceivable that these regions in the polypeptide are involved in the binding and/or 
removal of the pyrophosphate during the condensation of two GGPP molecules. 

The crtQ gene encoding ^^-carotene desaturase (formerly zds) Vs^as cloned 

20 from Anabaena sp. strain PCC 7120 by screening an expression library of 
cyanobacterial genomic DNA in cells of Escherichia coli carrying the Erwinia sp. 
crtB and crtE genes and the cyanobacterial crtP gene [see, Linden H, Vioque A 
and Sandmann G (1993) Isolation of a carotenoid biosynthesis gene coding for C,- 
carotene desaturase from Anabaena PCC 7120 by heterologous complementation. 

25 FEMS Microbiol Lett 106: 99-104]. Since these Escherichia coli cells produce C,- 
carotene, brownish-red pigmented colonies that produced lycopene could be 
identified on the yellowish background of cells producing ^-carotene. The 
predicted ^-carotene desaturase from Anabaena sp. strain PCC 7120 is a 56-kDa 
polypeptide which consists of 499 amino acid residues. Surprisingly, its primary 

30 structure is not conserved with the "plant-type" {crtP gene product) phytoene 
desaturases, but it has considerable sequence similarity to the bacterial-type 
enzyme {crti gene product) [see, Sandmann G (1993) Genes and enzymes 
involved in the desaturation reactions from phytoene to lycopene. (abstract), 10th 
International Symposium on Carotenoids, Trondheim CLl-2]. It is possible that 

35 the cyanobacterial crtQ gene and crtI gene of other microorganisms originated in 
evolution from a conunon ancestor. 

The crtL gene for lycopene cyclase (formerly Icy) was cloned from 
Synechococciis sp. strain PCC 7942 utilizing essentially the same cloning strategy 
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as for crtP. By using an inhibitor of lycopene cyclase, 2-(4-methylphenoxy)- 
triethylamine hydrochloride (MPTA), the gene was isolated by transformation of 
the wild-type to herbicide-resistance [see, Cunningham FX Jr, Chamovitz D, 
Misawa N, Gantt E and Hirschberg J (1993) Cloning and functional expression in 
Escherichia coli of a cyanobacterial gene for lycopene cyclase, the enzyme that 
catalyzes the biosynthesis of p-carotene. FEBS Lett 328: 130-138]. Lycopene 
cyclase is the product of a single gene product and catalyzes the double cyclization 
reaction of lycopene to P-carotene. The crtL gene product in Synechococcus sp. 
strain PCC 7942 is a 46-kDa polypeptide of 411 amino acid residues. It has no 
sequence similarity to the crtY gene product (lycopene cyclase) from Erwinia 
tiredovora or Erwinia herbicola. 

The gene for P-carotene hydroxylase {crtZ) and zeaxanthin glycosilase 
{crtX) have been cloned from Erwinia . herb icola [see, Hundle B, Alberti M, 
Nievelstein V, Beyer P, Kleinig H, Armstrong GA, Burke DH and Hearst JE 
(1994). Functional assignment of ErM>inia herbicola EholO carotenoid genes 
expressed in Escherichia coli, Mol Gen Genet 254: 406-416; Hundle BS, Obrien 
DA, Alberti Beyer P and Hearst JE (1;992) Functional expression of zeaxanthin 
glucosyltransferase from Erwinia herbicola and a proposed diphosphate binding 
site. Pi'oc Natl Acad Sci USA 89; 9321-9325] and from Erwinia uredovora [see, 
Misawa N, Nakagawa M, Kobayashi K, Yamano S, Izawa I, Nakamura K and 
Harashima K (1990) Elucidation of the Erwinia uredovora carotenoid biosynthetic 
pathway by functional analysis of gene products in Escherichia coli, J Bacteriol 
172: 6704-6712]. 

The ketocarotenoid astaxanthin ; (3,3'-dihydroxy-P,P-carotene-4,4'-dione) 
was first described in aquatic crustaceans as an oxidized form of p-carotene. 
Astaxanthin was later found to be very common in many marine animals and 
algae. However, only few animals can synthesize astaxanthin de novo from other 
carotenoids and most of them obtain it in their food. In the plant kingdom, 
astaxanthin occurs mainly in some species of cyanobacteria, algae and lichens. 
However, it is found rarely also in petals of higher plant species [see, Goodwin 
TW (1980) The Biochemistry of the carotenoids. Vol. 1. 2nd Ed, Chapman and 
Hall, London and New York]. 

The function of astaxanthin as a powerful antioxidant in animals has been 
demonstrated [see, Miki W (1991) Biological functions and activities of animal 
carotenoids. Pure Appl Chem 63: 141]. Astaxanthin is a strong inhibitor of lipid 
peroxidation and has been shown to play an active role in the protection of 
biological membranes from oxidative injury [see, Palozza P and Krinsky NI (1992) 
Antioxidant effects of carotenoids in vivo and in vitro - an overview. Methods 
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Enzymol 213: 403-420; and, Kurashige M, Okimasu E, Inove M and Utsumi K 
(1990) Inhibition of oxidative injury of biological membranes by astaxanthin. 
Physiol Chem Phys Med NMR 22: 27]. The chemopreventive effects of 
astaxanthin have also been investigated in which astaxanthin was shown to 

5 significantly reduce the incidence of induced urinary bladder cancer in mice [see, 
Tanaka T, Morishita Y, Suzui M, Kojima T, Okumura A. and Mori H (1994). 
Chemoprevention of mouse urinary bladder carcinogenesis by the naturally 
occurring carotenoid astaxanthin. Carcinogenesis 15: 15]. It has also been 
demonstrated that astaxanthin exerts immunomodulating effects by enhancing 

10 antibody production [see, Jyonouchi H, Zhang L and Tomita Y (1993) Studies of 
immunomodulating actions of carotenoids. II. Astaxanthin enhances in vitro 
antibody production to T-dependent antigens without facilitating polyclonal B-cell 
activation. Nutr Cancer 19: 269; and, Jyonouchi H, Hill JR, Yoshifiimi T and 
Good R A (1991) Studies of immunomodulating actions of carotenoids. I. Effects 

15 of P-carotene and astaxanthin on murine lymphocyte functions and cell surface 
marker expression in-vitro culture system. Nutr Cancer 16: 93]. The complete 
biomedical properties of astaxanthin remain to be elucidated, but initial results 
suggest that it could play an important role in cancer and tumor prevention, as 
well as eliciting a positive response from the immune system. 

20 Astaxanthin is the principal carotenoid pigment of salmonids and shrimps 

and imparts attractive pigmentation in the eggs, flesh and skin [see, Torrisen OJ, 
Hardy RW, Shearer KD (1989) Pigmentation of saimonid-carotenoid deposition 
and metabolism in salmonids.. Crit Rev Aquatic Sci 1: 209], The world-wide 
hai-vest of salmon in 1991 was approximately 720,000 MT., of which 25-30% 

25 were produced in a variety of aquaculture facilities [see, Meyers SP (1994) 
Developments in world aquaculture, feed formulations, and role of carotenoids. 
Pure Appl Chem 66: 1069]. This is set to increase up to 460,000 MT. by the year 
2000 [see, Bjomdahl T (1990) The Economics of Salmon Aquaculture. Blackwell 
Scientific, Oxford, pp. 1]. The red coloration of the salmonid flesh contributes to 

30 consumer appeal and therefore affects the price of the final product. Animals 
cannot synthesize carotenoids and they acquire the pigments through the food 
chain from the primary producers - marine algae and phytoplankton. Those grown 
in intensive culture usually suffer from suboptimal color. Consequently, 
carotenoid-containing nourishment is artificially added in aquaculture, at 

35 considerable cost to the producer. 

Astaxanthin is the most expensive commercially used carotenoid compound 
(todays- 1995 market value is of 2,500-3,500 $/kg). It is utilized mainly as 
nutritional supplement which provides pigmentation in a wide variety of aquatic 
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animals. In the Far-East it is used also for feeding poultry to yield a typical 
pigmentation of chickens. It is also a desirable and effective nontoxic coloring for 
the food industry and is valuable in cosmetics. Recently it was reported that 
astaxanthin is a potent antioxidant in humans and thus is a desirable food additive. 

Natural (3*^,3'^ astaxanthin is limited in availability. It is commercially 
extracted from some Crustacea species [see, Torrisen OJ, Hardy RW, Shearer KD 
(1989) Pigmentation of salmonid-carotenoid deposition and metabolism in 
salmonids. Crit Rev Aquatic Sci 1: 209]. The {3R,yR) stereoisomer of 
astaxanthin is produced from Phaffia [a yeast specie, see, Andrewes AG, Phaff HJ 
and Starr MP (1976) Carotenoids of Phaffia rhodozyma, a red-pigmented 
fermenting yeast. Phytochemistry Vol. 15, pp. 1003-1007]. Synthetic astaxanthin, 
comprising a 1:2:1 mixture of the (3iS',3'iS')-, {2>S,yRy and (37?,3*7?)-isomers is now 
manufactured by Hoffman-La Roche and sold at a high price (ca. $2,500/Kg) 
under the name "CAROPHYLL Pink" [see, Mayer FL (1994) Reflections on 
carotenoid synthesis. Pure & Appl Chem, Vol. 66, pp. 931-938]. Recently a novel 
gene involved in ketocompound biosynthesis, designated crtW was isolated from 
the marine bacteria Agrobacterium auranticacum and Alcaligenes PC-1 that 
prodvice ketocarotenoids such as astaxanthin. When the crrJ^ gene was introduced 
into engineered Eschrichia coli that accumulated p-carotene due to Erwinia 
carotenogenic genes, the Escherichia coli transformants synthesized canthaxanthin 
a precursor in the synthetic pathway of astaxanthin [see, Misawa N, Kajiwara S, 
Kondo K, Yolcoyama A, Satomi Y, Saito T, Miki W and Ohtani T (1995) 
Canthaxanthin biosynthesis by the conversion of methylene to keto groups in a 
hydrocarbon ^-carotene by a single gene. Biochemical and biophysical research 
communications Vol. 209, pp. 867-876]. It is therefore desirable to find a 
relatively inexpensive source of {3S^yS) astaxanthin to be used as a feed 
supplement in aquaculture and as a valuable chemical for various other industrial 
uses. 

Although astaxanthin is synthesized in a variety of bacteria, fungi and algae, 
the key limitation to the use of biological systems for its production is the low 
yield of and costly extraction methods in these systems compared to chemical 
synthesis. One way to solve these problems is to increase the productivity of 
astaxanthin production in biological systems using recombinant DNA technology. 
This allows for the production of astaxanthin in genetically engineered host which, 
in the case of a higher plant, is easy to grow and simple to extract. Furthermore, 
production of astaxanthin in genetically engineered host enables by appropriate 
Jiost selection to use thus produced astaxanthin in for example aquaculture 
applications, devoid of the need for extraction. 
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There is thus a widely recognized need for, and it would be highly 
advantageous to have, a nucleic acid segment which encodes p-C-4-oxygenase, the 
enzyme that converts P-carotene to canthaxanthin, as well as recombinant vector 
molecules comprising a nucleic acid sequence according to the invention, and host 
cells or transgenic organisms transformed or transfected with these vector 
molecules or DNA segment for the biotechnological production of (3*5,3 'iS) 
astaxanthin. 

Other features and advantages of the invention will be apparent from the 
following description and from the claims. 

It is a general object of this invention to provide a biotechnological method 
for production of (3iS',3'^ astaxanthin. 

It is a specific object of the invention to provide a peptide having a p-C-4- 
oxygenase activity and a DNA segment coding for this peptide to enable a 
biotechnological production of astaxanthin and other xanthophylls. 

It is a further object of the invention to provide an RNA segments coding 
for a polypeptide comprising an amino acid sequence corresponding to above 
described peptide. 

It is yet a further object of the invention to provide a recombinant DNA 
molecule comprising a vector and the DNA segment as described above. 

It is still a further object of the invention to provide a host cell containing 
the above described recombinant DNA molecule. 

. It is another object of the invention to provide a host transgenic organism 
containing the above described recombinant DNA molecule or the above described 
DNA segment in its cells. 

It is still another object of the invention to provide a host transgenic 
organism which expresses p-C-4-oxygenase activity in chloroplasts and/or 
chromoplasts-containing tissues. 

It is yet another object of the invention to provide a food additive for animal 
or human consumption comprising the above described host cell or transgenic 
organism. 

It is still another object of the invention to provide a method of producing 
astaxanthin using the above described host cell or transgenic organism. 

It is a further object of the invention to provide a method of producing 
canthaxanthin, echinenone, cryptoxanthin, isocryptoxanthin hydroxyechinenone, 
zeaxanthin, adonirubin, and/or adonixanthin using the above described host cell or 
transgenic organism. 
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Further objects and advantages of the present invention will be clear from 
the description that follows. 

In one embodiment, the present invention relates to a DNA segment coding 
for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene. 

In a further embodiment, the present invention relates to an RNA segment 
coding for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene. 

In yet another embodiment, the present invention relates to a polypeptide 
comprising an amino acid sequence corresponding to a Haematococcus pluvialis 
crtO gene. 

In a further embodiment, the present invention relates to a recombinant 
DNA molecule comprising a vector and a DNA segment coding for a polypeptide, 
corresponding to a Haematococcus pluvialis crtO gene. 

In another embodiment, the present . invention relates to a host cell 
containing the above described recombinant DNA molecule or DNA segment. 

In a further embodiment, the present invention relates to a host transgenic 
organism containing the abovq described recombinant DNA molecule or the above 
described DNA segment in its cells. 

In another embodiment, the present invention relates to a method of 
producing astaxanthin using the above described host cell or transgenic organism. 

In yet another embodiment, the present invention relates to a method of 
producing other xanthophylls. 

In still another embodiment, the present invention relates to a method of 
obtaining high expression of a transgene in plants specifically in chromoplasts- 
containing cells. 

In one further embodiment, the present invention relates to a method of 
importing a carotenoid-biosynthesis enzyme encoded by a transgene into 
chromoplasts. 

BRIEF DESCRIPTION OF THR PR A WTN^S 

The invention herein described, by way of example only, with reference to 
the accompanying drawings, wherein: 

FIG. 1 is a general biochemical pathway of (^-carotene biosynthesis, in 
which pathway all molecules are depicted in an oW-trans configuration, wherein 
IPP is isopentenyl pyrophosphate, DMAPP is dimethylallyl pyrophosphate, GPP is 
geranyl pyrophosphate, FPP is farnesyl pyrophosphate, GGPP is geranylgeranyl 
pyrophosphate and, PPPP is prephytoene pyrophosphate; 
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FIG. 2 is an identity map between the nucleotide sequence of the crtO 
cDNA of the present invention (CRTOA.SEQ) and the cDNA cloned by Kajiwara 
et al., (CRTOJ.SEQ) [see, Kajiwara S, Kakizono T, Saito T, Kondo K, Ohtani T, 
Nishio N, Nagai S and Misawa N (1995) Isolation and functional identification of 
5 a novel cDNA for astaxanthin biosynthesis from Haematococcus pluvialis, and 
astaxanthin synthesis in Escherichia coli. Plant Molec Biol 29: 343-352], using a 
GCG software, wherein (:) indicate identity, (-) indicate a gap and nucleotides 
numbering is according to SEQ ID N0:4 for CRTOA.AMI and Kajiwara et al, for 
CRTO J. AMI; 

10 FIG. 3 is an identity map between the amino acid sequence encoded by the 

crtO cDNA of the present invention (CRTOA.AMI) and the amino acid sequence 
encoded by the cDNA cloned by Kajiwara et al., (CRTOJ.AMI) [see, Kajiwara S, 
Kakizono T, Saito T, Kondo K, Ohtani T, Nishio N, Nagai S and Misawa N (1995) 
Isolation and functional identification of a novel cDNA for astaxanthin 

15 biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis in 
Escherichia coli. Plant Molec Biol 29: 343-352], using a GCG software, wherein 
(:) indicate identity, (-) indicate a gap and amino acids numbering is according to 
SEQ ID NO:4 for CRTOA.AMI and Kajiwara et al., for CRTOJ.AMI; 

FIG. 4 is a schematic depiction of a pACYC184 derived plasmid designated 

20 pBCAR and includes the genes crtE, crtB, crti and crtY of Erwinia herbicola, 
which genes are required for production of (J-carotene in Escherichia coli cells; 

FIG. 5 is a schematic depiction of a pACYC184 derived plasmid designated 
pZEAX and includes the genes crtE, crtB, crtI, crtY and crtZ from Erwinia 
herbicola, which genes are required for production of zeaxanthin in Escherichia 

25 coli cells; 

FIG. 6 is a schematic depiction of a pBluescriptSK" derived plasmid 
designated pHPK, containing a full length cDNA insert encoding a p -carotene C- 
4-oxygenase enzyme from Haematococcus pluvialis, designated crtO and set forth 
in SEQ ID NO:l, which cDNA was identified by color complementation of 

30 Escherichia coli cells; 

FIG. 7 is a schematic depiction of a pACYC184 derived plasmid designated 
pCANTHA which was derived by inserting a 1.2 kb Pstl-Pstl DNA fragment, 
containing the cDNA encoding the p-C-4-oxygenase from Haematococcus 
pluvialis isolated from the plasmid pHPK of Figure 6 and inserted into a Pstl site 

35 in the coding sequence of the crtZ gene in the plasmid pZEAX of Figure 5; this 
recombinant plasmid carries the genes crtE, crtB, crtI, crtY of Erwinia herbicola 
and the crtO gene of Haematococcus pluvialis, all required for production of 
canthaxanthin in Escherichia coli cells; 




wo 98/18910 PCT/US97/17819 

22 

FIG. 8 is a schematic depiction of a pACYCl 84 derived plasmid designated 
pASTA which was derived by inserting the 1.2 kb Pstl-Pstl DNA fragment, 
containing the cDNA of the (5-C-4-oxygenase from Haematococcus pluvialis 
isolated from the plasmid pHPK of Figure 6 and inserted into a Pstl site which 

5 exists 600 bp downstream of the crtE gene in the plasmid pZEAX of Figure 5; this 
recombinant plasmid carries the genes crtE^ crtB, crtU crtY, crtZ of Erwinia 
herbicola and the crtO gene of Haematococcus pluvialis^ all required for 
production of astaxanthin in Escherichia coli cells; 

FIG. 9 is a schematic depiction of a pBR328 derived plasmid designated 

10 PANS.S-ICETO which was derived by inserting the 1.2 kb Pstl-Pstl DNA 
fragment, containing the cDNA of the (i-C-4-oxygenase from Haematococcus 
pluvialis isolated from the plasmid pHPK of Figure 6 and inserted into a Pstl site 
which exists in a p-lactamase gene in a plasmid designated pPAN35D5 [described 
in Hirschberg J, Ohad Pecker I and Rahat A (1987) Isolation and 

15 characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-112], which carries the psbAI gene 
from the cyanobacterium Synechococcus PCC7942 in the plasmid vector pBR328 
[see, Hirschberg J, Ohad N, Pecker I and Rahat A (1987) Isolation and 
characterization of herbicide resistant . mutants in the cyanobacterium 

20 Synechococcus R2. Z. Naturforsch 42c: 102-1 12]; this recombinant plasmid carries 
the crtO gene of Haematococcus pluvialis, required for production of astaxanthin 
in Synechococcus PCC7942 cells; 

FIG. 10 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid {E. coli, Agrobacterium) designated pBIB [described by Becker D 

25 (1990) Binary vectors which allow the exchange of plant selectable markers 
and reporter genes. Nucleic Acids Research 18:230] which is a derivative of the 
Ti plasmid pBIlOl [described by Jeffesrson AR, Kavanagh TA and Bevan WM 
(1987) GUS fusions: p-glucuronidase as a sensitive and versatile gene fusion 
marker in higher plants. The EMBO J. 6: 3901-3907], wherein Br and Bl are 

30 the right and left borders, respectively, of the T-DNA region, pAg7 is the 
polyadenylation site of gene 7 of Agrobacterium Ti-plasmid, pAnos is a 250 bp 
long DNA fragment containing the poly adenylation site of the nopaline 
synthase gene of Agrobacterium, NPT II is a 1,800 bp long DNA fragment 
coding for kanamycin resistance, pnos is a 300 bp long DNA fragment 

35 containing the promoter sequence of the nopaline synthase gene of 
Agrobacterium, whereas pAnos is a 300 bp long DNA fragment containing the 
poly adenylation site of the nopaline synthase gene oi Agrobacterium, 



• 
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FIG. 1 1 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid {E, coli, Agrobacterium) designated pPTBIB which was prepared by 
cloning a genomic DNA sequence of a tomato species Lycopersicon 
esculentum marked PT (nucleotides 1 to 1448 of the Pds gene as published in 

5 Mann V, Pecker I and Hirschberg J (1994) cloning and characterization of the 
gene for phytoene desaturase (Pds) from tomato {Lycopersicon esculentum). 
Plant Molecular Biology 24: 429-434), which contains the promoter of the Pds 
gene and the coding sequence for the amino terminus region of the polypeptide 
PDS that serve as a transit peptide for import into chloroplasts and 

10 chromoplasts, into a Hindlll-Smal site of the binary plasmid vector pBIB of 
Figure 10, wherein Br and Bl, pAg7, pAnos, NPT II, pnos and pAnos are as 
defined above; 

FIG. 12 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid (E. coli, Agrobacterium) . designated pPTCRTGBIB which was 

15 prepared by cloning a 1,110 nucleotide long EcoAlWl-Ncol fragment of the 
cDNA of crtO from H. pluvialis (nucleotides 211 to 1321 of SEQ ID NO:l) 
into the Smal site of the plasmid pPTBIB of Figure 11, such that the coding 
nucleotide sequence of the amino terminus of PDS is in the same reading frame 
of crtO, wherein Br and Bl, pAg7, pAnos, NPT II, pnos, and pAnos are as 

20 defined above, PT is the promoter and transit peptide coding sequences of Pds 
from tomato and CRTO is the nucleotide sequence of crtO from H, pluvialis 
(nucleotides 211 to 1321 of SEQ ID N0:1); 

FIG. 13 shows a Southern DNA blot analysis of //miilll-digested genomic 
DNA extracted from wild type (WT) and crtO tobacco transgenic plants, 

25 designated 2, 3, 4, 6, 9 and 10, according to the present invention, using the crtO 
cDNA as a radioactive probe essentially as described in Sambrook et aL, 
Molecular Cloning; A Laboratory Manual. Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y. 1989, wherein the size of marker (M) DNA fragments in 
kilobase pairs (kb) is indicated on the left as well as the expected position (arrow) 

30 of an internal T-DNA Hindlll fragment as was deduced from the sequence of 
pPTPDSBIB shown in Figure 12 which contain the crtO cDNA sequence; 
FIG. 14 shows a biosynthesis pathway of astaxanthin; 
FIG. 15 shows a flower from a wild type tobacco plant and a flower from a 
transgenic tobacco plant according to the present invention. 



nFS<"l?TT>TTn>J OF THF prefer red EMBODIME NTS 

The present invention is, in general, of a biotechnological method for 
production of (35,3'iS) astaxanthin. In particular, the present invention is of a 
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peptide having a P-C-4-oxygenase activity; a DNA segment coding for this 
peptide; an RNA segments coding for this peptide; a recombinant DNA molecule 
comprising a vector and the DNA segment; a host cell or organism containing the 
above described recombinant DNA molecule or DNA segment; and of a method 
for biotechnologically producing {3S,yS) astaxanthin or a food additive containing 
{3S,yS) astaxanthin, using the host. 

The unicellular fresh-water green alga Haematococcus pluvialis 
accumulates large amounts of (3S,3'iS) astaxanthin when exposed to unfavorable 
growth conditions, or following different environmental stresses such as phosphate 
or nitrogen starvation, high concentration of salt in the growth medium or high 
light intensity [see, Yong YYR and Lee YK (1991) Phycologia 30 257-261; Droop 
MR (1954) Arch Microbiol 20: 391-397; and, Andrewes A.G, Borch G, Liaaen- 
Jensen S and Snatzke G.(1974) Acta Chem Scand B28: 730-736]. During this 
process, the vegetative cells of the alga form cysts and change their color from 
green to red. The present invention discloses the cloning of a cDNA from 
Haematococcus pluvialis^ designated crtO, which encodes a p-C-4-'0xygenase, the 
enzyme that converts P-carotene to canthaxanthin, and its expression in a 
heterologous systems expressing p-carotene hydroxylase (e.g., Erwinia herbicola 
crtZ gene product), leading to the production of {3S,yS) astaxanthin. 

The crtO cDNA and its encoded peptide having a p-C-4-oxygenase activity 
are novel nucleic and amino acid sequences, respectively. The cloning method of 
the crtO cDNA took advantage of a strain of Escherichia coli^ which v^as 
genetically engineered to produce p-carotene, to which a cDNA library of 
Haematococcus pluvialis was transfected and expressed. Visual screening for 
brown-red pigmented Escherichia coli cells has identified a canthaxanthin 
producing transformant. Thus cloned cDNA has been expressed in two 
heterologous systems {Escherichia coli and Synechococcus PCC7942 cells) both 
able to produce P-carotene and further include an engineered {Erwinia herbicola 
crtZ gene product) or endogenous P-carotene hydroxylase activity, and was shown 
to enable the production of (3iS,3'iS) astaxanthin in both these systems. 

The crtO cDNA or its protein product exhibit no meaningful nucleic- or 
amino acid sequence similarities to the nucleic- or amino acid sequence of crtW 
and its protein product isolated from the marine bacteria Agrobacterium 
auranticacum and Alcaligenes PC-1 that produce ketocarotenoids such as 
astaxanthin [see, Misawa N, Kajiwara S, Kondo K, Yokoyama A, Satomi Y, Saito 
T, Miki W and Ohtani T (1995) Canthaxanthin biosynthesis by the conversion of 
methylene to keto groups in a hydrocarbon p-carotene by a single gene. 
Biochemical and biophysical research communications Vol. 209, pp. 867-876]. 
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However, the crtO cDNA and its protein product exhibit substantial 
nucleic- and amino acid sequence identities with the nucleic- and amino acid 
sequence of a recently cloned cDNA encoding a 320 amino acids protein product 
having (i-carotene oxygenase activity, isolated from Haematococcus pluvialis [see, 

5 Kajiwara S, Kakizono T, Saito T, Kondo K, Ohtani T, Nishio N, Nagai S and 
Misawa N (1995) Isolation and functional identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis 
in Escherichia coli. Plant Molec Biol 29: 343-352]. Nevertheless, as presented in 
Figure 2 the degree of sequence identity between the crtO cDNA (CRTOA.SEQ in 

10 Figure 2) and the cDNA described by Kajiwara et aL (CRTOJ.SEQ in Figure 2) 
[see reference above] is 75.7% and, as presented in Figure 3 the degree of 
sequence identity between the crtO cDNA protein product (CRTOA.AMI in 
Figure 3) and the protein described by Kajiwara et al (CRTOJAMI in Figure 3) is 
78%, as was determined using a GCG software. 

15 As will be described in details hereinbelow, the crtO cDNA can thus be 

employed to biotechnologically produce (3*5^,3 '5^ astaxanthin in systems which are 
either easy to grow and can be used directly as an additive to fish food, or systems 
permitting a simple and low cost extraction procedure of astaxanthin. 

In one embodiment, the present invention relates to a DNA segment coding 

20 for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene and allelic and species variations and 
functional naturally occurring and/or man-induced variants thereof The phrase 
'allelic and species variations and functional naturally occurring and/or man- 
induced variants' as used herein and in the claims below refer to the source of the 

25 DNA (or RNA as described below) or means known in the art for obtaining it. 
However the terms 'variation' and Variants' indicate the presence of sequence 
dissimilarities (i.e., variations). It is the intention herein and in the claims below 
that the sequence variations will be 77-80%, preferably 80-85%, more preferably 
85-90%, most preferably 90-100% of identical nucleotides. In a preferred 

30 embodiment the DNA segment comprises the sequence set forth in SEQ ID NO:l. 
In another preferred embodiment, the DNA segment encodes the amino acid 
sequence set forth in SEQ ID NO:4. 

The invention also includes a pure DNA segment characterized as including 
a sequence which hybridizes under high stringency conditions [e.g., as described in 

35 Sambrook et aL, Molecular Cloning; A Laboratory Manual. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. 1989] to a nucleic acid probe which 
includes at least fifteen, preferably at least fifty, more preferably at least hundred, 
even more preferably at least two himdred, even more preferably at least five 




wo 98/18910 PCT/US97/17819 

26 

hundred successive nucleotides of SEQ ID NO:l or SEQ ID N0:2. Alternatively, 
the DNA segment of the invention may be characterized as being capable of 
hybridizing under low-stringent conditions to a nucleic acid probe which includes 
the coding sequence (nucleotides 166 through 1152) of SEQ ID NO:l or SEQ ID 
5 NO:2. An example of such low-stringency conditions is as described in Sambrook 
et aL, using a lower hybridization temperature, such as, for example, 20^C below 
the temperature employed for high-stringency hybridization conditions, as 
described above. 

The DNA segment of the invention may also be characterized as being 
10 capable of hybridizing under high-stringent conditions to a nucleic acid probe 
which includes the coding sequence (nucleotides 166 through 1152) of SEQ ID 
NO:l or SEQ ID NO:2. 

The invention also includes a synthetically produced oligonucleotide (e.g., 
oHgodeoxyribonucleotide or oligoribonuGleotide" and analogs thereof) capable of 
15 hybridizing with at least ten-nucleotide segments of SEQ ID NO:l or SEQ ID 
NO:2. 

In another embodiment, the present invention relates to an RNA segment 
coding for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene and allelic and species variations and 

20 fimctional naturally occurring and/or man-induced variants thereof. In a preferred 
embodiment the RNA segment comprises the sequence set forth in SEQ ID NO:2. 
In another preferred embodiment, the RNA segment encodes the amino acid 
sequence set forth in SEQ ID NO:4. 

The invention also includes a pure RNA characterized as including a 

25 sequence which hybridizes under high stringent conditions to a nucleic acid probe 
which includes at least at least fifteen, preferably at least fifty, more preferably at 
least hundred, even more preferably at least two hundred, even more preferably at 
least five hundred succsesive nucleotides of SEQ ID NO:l or SEQ ID NO:2. 
Alternatively, the RNA of the invention may be characterized as being capable of 

30 hybridizing under low-stringent conditions to a nucleic acid probe which includes 
the coding sequence (nucleotides 166 through 1 152) of SEQ ID NO:l or SEQ ID 
NO:2. Additionally, the RNA of the invention may be characterized as being 
capable of hybridizing under high-stringent conditions to a nucleic acid probe 
which includes the coding sequence (nucleotides 166 through 1152) of SEQ ID 

35 NO:l orSEQIDNO:2. 

In another embodiment, the present invention relates to a polypeptide 
comprising an amino acid sequence corresponding to a Haematococcus pluvialis 
crtO gene and allelic, species variations and functional naturally occurring and/or 
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man-induced variants thereof. In a preferred embodiment, the polypeptide 
comprises the amino acid sequence set forth in SEQ ID NO;4, 

It should be noted that the invention includes any peptide which is 
homologous (i.e., 80-85%, preferably 85-90%, more preferably 90-100% of 
identical amino acids) to the above described polypeptide. The temi 'homologous' 
as used herein and in the claims below, refers to the sequence identity between two 
peptides. When a position in both of the two compared sequences is occupied by 
identical amino acid monomeric subunits, it is homologous at that position. The 
homology between two sequences is a function of the number of homologous 
positions shared by the two sequences. For example, if eight often of the positions 
in two sequences are occupied by identical amino acids then the two sequences are 
80% homologous. 

Other polypeptides which are also included in the present invention are 
allelic variations, other species homologs, natural mutants, induced mutants and 
peptides encoded by DNA that hybridizes under high or low stringency conditions 
(see above) to the coding region (nucleotides 166 through 1152) of SEQ ID NO:l 
or SEQ IDNO:2. 

In another embodiment, the present invention relates to a recombinant DNA 
molecule comprising a vector (for example plasmid or viral vector) and a DNA 
segment coding for a polypeptide, as described above. In a preferred embodiment, 
the DNA segment is present in the vector operably linked to a promoter. 

In a further embodiment, the present invention relates to a host ceil 
containing the above described recombinant DNA molecule or DNA segment. 
Suitable host cells include prokaryotes (such as bacteria, including Escherichia 
coli) and both lower eukaryotes (for example yeast) and higher eukaryotes (for 
example, algae, plant or animal cells). Introduction of the recombinant molecule 
into the cell can be effected using methods known in the art such as, but not 
limited to, transfection, transformation, micro-injection, gene bombardment etc. 
The cell thus made to contain the above described recombinant DNA molecules 
may be grown to form colonies or may be made to differentiate to form a 
differentiated organism. The recombinant DNA molecule may be transiently 
contained (e.g., by a process known in the art as transient transfection) in the cell, 
nevertheless, it is preferred that the recombinant DNA molecule is stably contained 
(e.g., by a process known in the art as stable transfection) in the cell. Yet in a 
preferred embodiment the cell is endogenously producing, or is made by genetic 
engineering means to produce, P-carotene, and the cell contains endogenous or 
genetically engineered P-carotene hydroxylase activity. Such a cell may be used 
as a food additive for animal (e.g., salmon) and human consumption. Furthermore, 
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such a cell may be used for extracting astaxanthin and/or other xanthophylls, as 
described hereinbelow. 

In a further embodiment, the present invention relates to a host transgenic 
organism (e.g., a higher plant or animal) containing the above described 

5 recombinant DNA molecule or the above described DNA segment in its cells. 
Introduction of the recombinant molecule or the DNA segment into the host 
transgenic organism can be effected using methods known in the art. Yet, in a 
preferred embodiment the host organism is endogenously producing, or is made by 
genetic engineering means to produce, (i-carotene and, also preferably the host 

10 organism contains endogenous or genetically engineered p-carotene hydroxylase 
activity. Such an organism may be used as a food additive for animal (e.g., 
salmon) and human consumption. Furthermore, such an organism may be used for 
extracting astaxanthin and/or other xanthophylls, as described hereinbelow. 

In another embodiment, the present invention relates to a method of 

15 producing astaxanthin using the above described host cell or transgenic organism. 
In yet another embodiment, the present invention relates to a method of producing 
xanthophylls such as canthaxanthin, echinenone, cryptoxanthin, isocryptoxanthin, 
hydroxyechinenone, zeaxanthin, adonirubin, 3 -hydroxy echinenone, 3'- 
hydroxyechinenone and/or adonixanthin using the above described host cell or 

20 transgenic organism. For these purposes , provided is a cell or a transgenic 
organism as described above. The host ceH or organism are made to jgrow under 
conditions favorable of producing astaxanthin and the above listed additional 
xanthophylls which are than extracted by methods known in the art. 

In yet another embodiment, the present invention relates to a transgenic 

25 plant expressing a transgene coding for a polypeptide including an amino^acid 
sequence corresponding to Haematococcus pluvialis crtP gon^^^ allelic and species 
variants oF^lunctional naturally occurring or man- induced variants thereof'^ 
Preferably the expression is highest in chromoplasts-containing tissues. 

In yet another embodiment, the present invention relateslo a recombinant 

30 DNA vector which includes a first DNA segment encoding a polypeptide for 
directing a protein into plant chloroplasts or chromoplasts (e.g., derived from the 
Pds gene of tomato) and an in frame second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 

35 induced variants thereof. 

In yet another embodiment, the present invention relates to a recombinant 
DNA vector which includes a first DNA segment including a promoter highly 
expressible in plant chloroplasts or chromoplasts-containing tissues (e.g., derived 
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from the Pds gene of tomato) and a second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 
induced variants thereof. 

Reference in now made to the following examples, which together with the 
above descriptions, illustrate the invention. 

EXAMPLES 

The following protocols and experimental details are referenced in the 
Examples that follow: 

Algae and growth conditions. HaematocGccus pluvialis (strain 34/7 from 
the Culture Collection of Algae and Protozoa, Windermere, UK) was kindly 
provided by Dr. Andrew Young from the Liverpool John Moores University. 
Suspension cultures of the alga were grown in a liquid medium as described by 
Nichols and Bold [see, Nichols HW, Bold HC (1964) Trichsarcina polymorpha 
gen et sp nov J Phycol 1: 34-39]. For induction of astaxanthin biosynthesis cells 
were harvested, washed in water and resuspended in a nitrogen-depleted medium. 
The cultures were maintained in 250 ml Erlenmeyer flasks under continuous light 
(photon flux of 75 |j.E/ni2/s), at 25°C, on a rotary shaker at 80 rpm. 

Construction of cDNA library. The construction of a cDNA library from 
Haematococcus pluvialis was described in detail by Lotan and Hirschberg (1995) 
FEBS letters 364: 125-128. Briefly, total RNA was extracted from algal cells 
grown for 5 days under nitrogen-depleted conditions (cell color brown-red). Cells 
from a 50 ml culture were harvested and their RNA content was extracted using 
Tri reagent (Molecular Research Center, INC.)- Poly-An RNA was isolated by 
two cycles of fractionation on oligo dT-cellulose (Boehringer). The final yield was 
1.5% of the total RNA. The cDNA library was constructed in a Uni-ZAPTM XR 
vector, using a ZAP-cDNA synthesis kit (both from Stratagene). Escherichia coli 
cells of strain XL 1 -Blue MRF' (Stratagene) were used for amphfication of the 
cDNA library. 

Plasmids and Escherichia coli strains. Plasmid pPL376, which contains 
the genes necessary for carotenoid biosynthesis in the bacterium Erwinia herbicola 
was obtained from Tuveson [for further details regarding plasmid pPL376 see, 
Tuveson RW, Larson RA & Kagan J (1988) Role of cloned carotenoid genes 
expressed in Escherichia coli in protecting against inactivation by near-UV light 
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and specific phototoxic molecules. J Bacteriol 170: 4675-4680]. Cells of 
Escherichia coli strain JM109 that carry the plasmid pPL376 accumulate the bright 
yellow carotenoid, zeaxanthin glycoside. In a first step, a LI kb Sali-SaH 
iragment was deleted from this plasmid to inactivate the gene crtX, coding for 

5 zeaxanthin glucosyl transferase. In a second step, partial 5amHI cleavage of the 
plasmid DNA, followed by self ligation, deleted a 0.8 kb fragment which 
inactivated crtZ^ encoding p-carotene hydroxylase. A partial BglW cleavage 
generated a fragment of 7.4 kb which was cloned in the BamHI site of the plasmid 
vector pACYC184. As shown in Figure 4, the resulting recombinant plasmid, 

10 which carried the genes crtE, crtB, crti and crtY, was designated pBCAR [Lotan 
and Hirschberg (1995) FEES letters 364: 125-128]. 

Plasmid pBCAR was transfected into SOLR strain cells of Escherichia coli 
(Stratagene). Colonies that appeared on chloramphenicol-containing Luria Broth 
(LB) medium [described in Sambrook et al.. Molecular Cloning; A Laboratory 

!5 Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 1989], carried 
this plasmid and developed a deep yellow-orange color due to the accumulation of 
j3-carotene. 

As shown in Figure 5, an additional plasmid, designated pZEAX, which 
allows for zeaxanthin synthesis . and ^ accumulation in Escherichia coli was 

20 constructed [this plasmid is described in details in Lotan and Hirschberg (1995) 
FEBS letters 364: 125-128]. SOLR strain Escherichia coli cells were used as a 
host for the pZEAX plasmid. Escherichia coli cells were grown on LB medium 
(see above), at 37°C in the dark on a rotary shaker at 225 rpm. Ampiciliin (50 
g/ml) and/or chloramphenicol (30 jig/ml) (both from Sigma) were added to the 

25 medium for selection of appropriate transformed cells. 

As shown in Figure 6, a plasmid, pHPK, containing the full length cDNA of 
the p-carotene C-4-oxygenase enzyme was identified by color complementation as 
described by Lotan and Hirschberg (1995) FEBS letters 364: 125-128 (see 
description herein below). A L2 kb Pstl-Pstl DNA fragment, containing the 

30 cDNA of the p-C-4-oxygenase from Haematococcus pluvialis^ was isolated from 
plasmid pHPK and inserted into a Pstl site in the coding sequence of the crtZ gene 
in the plasmid pZEAX. This recombinant plasmid was designated pCANTHA and 
is shown in Figure 7. 

The same L2 kb Pstl-Pstl fragment was also inserted into a Pstl site which 

35 exists 600 bp downstream of the crtE gene in the plasmid pZEAX. The resulting 
recombinant plasmid was designated pASTA and is shown in Figure 8. 

The same 1.2 kh Pstl-Pstl fragment was also inserted into a Pstl site which 
exists in the p-lactamase gene in the plasmid pPAN35D5 [Hirschberg J, Ohad N, 



wo 98/18910 



PCT/US97/17819 



31 

Pecker I and Rahat A (1987) Isolation and characterization of herbicide resistant 
mutants in the cyanobacterium Synechococcus R2. Z. Naturforsch 42c: 102-112], 
which carries the psbAl gene from the cyanobacterium Synechococcus PCC7942 in 
the plasmid vector pBR328 [Hirschberg J, Ohad N, Pecker I and Rahat A (1987) 
Isolation and characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-112]. Tliis plasmid was designated 
PAN3.5-KETO and is shown in Figure 9. This plasmid was used in the 
transformation of Synechococcus PCC7942 cells following procedures described 
by Golden [Golden SS (1988) Mutagenesis of cyanobacteria by classical and 
gene-transfer-based methods. Methods Enzymol 167: 714-727]. 

Excision of phage library and screening for a P-carotene oxygenase 
gene. Mass excision of the cDNA library, which was prepared as described 
hereinabove, was carried out using the ExAssist helper phage (Stratagene) in cells 
of SOLR strain Escherichia coli that earned the plasmid pBCAR. The e>:ciscd 
library in phagemids form was transfected into Escherichia coli cells strain XLl- 
Blue and the cells were plated on LB plates containing 1 mM isopropylthio-p-D- 
galactosidase (IPTG), 50 fig/ml ampicillin and 30 ^g/ml chloramphenicol, in a 
density that yielded approximately 100-150 colonies per plate. The plates were 
incubated at 37°C overnight and flirther incubated for two more days at room 
temperature. The plates were then kept at 4°C until screened for changes in colony 
colors. 

A plasmid for high expression of crtO in chroi uoplasts . As shown in 
Figures 10-11, a genomic DNA sequence of a tomato species Lycopersicon 
esculentum (nucleotides 1 to 1448 of the Pds gene [as published in Mann V, 
Pecker I and Hirschberg J (1994) cloning and characterization of the gene for 
phytoene desaturase (Pds) from tomato {Lycopersicon esculentum). Plant 
Molecular Biology 24: 429-434], which contains the promoter of the Pds gene 
and the coding sequence for the amino terminus region of the polypeptide PDS 
that serve as a transit peptide for import into chloroplasts and chromoplasts, 
was cloned into a Hindlll-Smal site of the binary plasmid vector pBIB, 
[described by Becker D (1990) Binary vectors which allow the exchange of 
plant selectable markers and reporter genes. Nucleic Acids Research 18:230], 
shown in Figure 10. The recombinant plasmid was designated pPTBIB and is 
shown in Figure 11. 

As shown in Figure 12, a 1,1 10 nucleotide long Eco47llI-Ncol fragment, 
containing the cDNA of crtO from H. pluvialis (nucleotides 211 to 1321 of 
SEQ ID NO:l) was sub-cloned into the Smal site of the plasmid pPTBIB 
(Figure 11) so that the coding nucleotide sequence of the amino terminus of 
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Pds is in the same reading frame as crtO. The recombinant plasmid was- 
designate pPTCRTOBIB. 

Formation of transgenic higher plant. The DNA of pPTCRTOBIB 

was extracted from E. coli cells and was transferred into cells of 

5 Agrobacterium tumefaciens strain EHA105 [described by Hood EE, Gelvin SB, 
Melchers LS and Hoekema A (1993) Transgenic Research 2:208-218] using 
electroporation as described for E, coli [Dower JW, Miller FJ and Ragdsale 
WC (1988) High efficiency transformation of E. coli by high voltage 
electroporation. Nuc. Acids Res. 18: 6127-6145]. Agrobacterium cells were 

10 grown at 28 in LB medium supplemented with 50 luig/ml streptomycin and 
50 i-ig/ml kanamycin as selective agents. Cells of Agrobacterium carrying 
pPTCRTOBIB were harvested from a suspension culture at the stationary phase 
of growth and used for transformation as described by Horsch RB, Fry JE, 
Hoffmann NL, Eicholtz D, Rogers SG and Fraley RT, A simple and general 

15 method for transferring genes into plants. Science (1985) 227:1229-1231; and 
Jeffesrson AR, Kavanagh TA and Bevan WM (1987) GUS fusions: p- 
glucuronidase as a sensitive and versatile gene fusion marker in higher plants. 
The EMBO J. 6: 3901-3907. 

Leaf explants of Nicotiana tobaccum strain NN were infected v/ith the 

20 transformed Agrobacterium celts and. kanamycin-resistant transgenic plants 
were regenerated according to protocols described by Horsch et al. (1985)* and 
Jefferson et al. (1987) cited above. 

With reference now to Figure 13, the presence of the DNA sequence of the 
crtO gene-construct in the fully developed regenerated plants was determined by 

25 DNA Southern blot analysis. To this end DNA was extracted from the leaves 
[according to a protocol described by Kanazawa and Tsutsumi (1992) Extraction 
of restrictable DNA from plants of the genus Nelumbo, Plant Molecular Biology 
Reports 10: 316-318], digested with the endonuclease HincUll, the fragrnents were 
size separated by gel electrophoresis and hybridized with radioactively labeled 

30 crtO sequence (SEQ ID NO: 1). 

It was determined that each transgenic plant that was examined contained at 
least one copy of the crtO DNA sequence, yielding a 1.75 kb band (arrow), 
originating from an internal Hindlll-HindUl fragment of the T-DNA of 
pPTCRTOBIB, additional bands originating from partial digestion, additional 

35 band/s whose sizes vary, depending on the position of insertion in the plant 
genome and a 1 .0 kb band originating from the tobacco plant itself which therefore 
also appears in the negative control WT lane. 
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Sequence analysis. DNA sequence analysis was carried out by the dideoxy 
method [see, Sanger F, Nicklen S & Coulsen AR (1977) DNA sequencing with 
chain termination inhibitors. Proc Natl Acad Sci USA 74: 5463-5467]. 

Carotenoids analysis. Aliquots of Escherichia coli cells which were 
5 grown in liquid in LB medium were centrifuged at 13,000 g for 10 minutes, 
washed once in water and re-centrifuged. After removing the water the cells were 
resuspended in 70 of acetone and incubated at 65°C for 15 minutes. The 
samples were centrifuged again at 13,000 g for 10 minutes and the carotenoid- 
containing supernatant was placed in a clean tube. The carotenoid extract was 
10 blown to dryness under a stream of nitrogen (N2) gas and stored at -lO'^C until 
required for analysis. Carotenoids from plant tissues were extracted by mixing 
0.5-1.0 gr of tissue with 100 |.l1 of acetone followed by incubation at 65°C for 15 
minutes and then treating the samples as described above. 

High-performance liquid chromalography (HPLC) of the carotenoid 
15 extracts was carried out using an acidified reverse-phase CI8 column, Spherisorb 
ODS-2 (silica 5 ^im 4.6 mm x 250 mm) (Phenomenex®). The mobile phase was 
pumped by triphasic Merck-Hitachi L-6200A high pressure pumps at a flow rate of 
1.5 ml/min. The mobile phase consisted of an isocratic solvent system comprised 
of hexane/dichloromethane/isopropyl alcohol/triethylamine (88.5:10:1.5:0.1, v/v). 
20 Peaks were detected at 470 nm using a Waters 996 photodiode-array detector. 
Individual carotenoids were identified by their retention times and their typical 
absorption spectra, as compared to standard samples of chemically pure P- 
carotene, zeaxanthin, echinenone, canthaxanthin, adonirubin and astaxanthin 
(The latter four were kindly provided by Dr. Andrew Young from Liverpool John 
25 Moores University). 

Thin layer chromatography (TLC) was carried out using silica gel 60 F254 
plates (Merck), using ethyl acetate/benzene (7:3, v/v) as an eluent. Visible 
absorption spectra were recorded with a Shimadzu UV-160A spectrophotometer. 
All spectra were recorded in acetone. Spectral fine structure was expressed in 
30 terms of %III/II [Britton, G. (1995). UVA^isible Spectroscopy. In: Carotenoids; 
Vol IB, Spectroscopy. Eds. Britton G, Liaaen-Jensen S and Pfander H. 
Birkhauser Verlag, Basel, pp. 13-62]. 

Isolation and identification of the carotenoids extracted from ceils of £. coli 
are treated in order of increasing adsorption (decreasing R/-values) on silica TLC 
35 plates, Carotenoids structure and the biosynthesis pathway of astaxanthin are 
given in Figure 14. The following details refer to the carotenoids numbered 1 
through 9 in Figure 14. 
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P-Carotene (1). R^O.92 inseparable from authentic (1). R/ .VIS A-max 
(428), 452, 457, %m/II = 0. 

Echinenone (2). R/'0,90 inseparable from authentic (2). Kf .VIS A-niax 
455, %III/II = 0. 

5 Canthaxanthin (3). Rf 0.2^7. inseparable from authentic (3). R/ .VIS A^niax 

nm: 470, %III/II = 0. 

p-Cryptoxanthin (4). R^ .VIS X^ax nm: (428), 451, 479, %I1I/II = 

0. 

Adonirubin (5). R/'0.82 inseparable from authentic (5). R/ .VIS A^j^iax 
10 476, %III/II = 0. 

Astaxanthin (6). RfOJ9 inseparable from authentic (6). Rf .VIS Xmax 
nm: 477, %m/II-0. 

Adonixanthin (7). R/^0.72. R/ .VIS X^ax nm: 464, %III/II = 0. 
Zeaxanthin (8): Ry^O.65 inseparable from authentic (8). .VIS /^nax ^m: 
15 (428), 451, 483, %III/II = 27. 

Hydroxyechinenone (9). R/0.80, R?, 3.0. VIS T^rnax nm: 464, %III/II = 0. 
Chirality configuration. Chirality configuration of astaxanthin was 
determined by HPLC of the derived diastereoisomeric camphanates of the 
astaxanthin [Renstrom B, Borch G, Skulberg M and Liaaen- Jensen S (1981) 
20 Optical purity of (35'35")"astaxanthin from Haematococcus phivialis. Phytochem 
20: 2561-2565]. The analysis proved that the Escherichia coli cells synthesize 
pure (36^,3 'S) astaxanthin. 

EXAMPLE 1 

25 Cloning the ^-C^4''Oxygenase gene 

A cDNA library was constructed in Lambda ZAP II vector from poly-An 
RNA of Haematococcus pluvialis cells that had been induced to synthesize 
astaxanthin by nitrogen deprivation as described hereinabove. The entire library 

30 was excised into p-carotene-accumulating cells of Escherichia coli, strain SOLR, 
which carried plasmid pBCAR (shown in Figure 4). Screening for a p-carotene 
oxygenase gene was based on color visualization of colonies of size of 3 mm in 
diameter. Astaxanthin and other oxygenated forms of P-carotene (i.e., 
xanthophylls) have distinct darker colors and thus can be detected from the yellow 

35 p-carotene background. The screening included approximately 100,000 colonies 
which were grown on LB medium plates containing ampicillin and 
chloramphenicol that selected for both the Lambda ZAP II vector in its plasmid 
propagating fomi and the pBCAR plasmid. Several colonies showed different 
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color tones but only one exhibited a conspicuous brown-red pigment. This colony 
presumed to contain a xanthophyll biosynthesis gene was selected for further 
analysis described hereinbelow in the following Examples. 

EXAMPLE 2 

Analysis of the ^-C'4-oxygenase activity in Escherichia coli 

The red-brown colony presumed to contain a xanthophyll biosynthesis gene 
(see Example 1 above) was streaked and further analyzed. First, the recombinant 
ZAP II plasmid carrying the cDNA clone that was responsible for xanthophyll 
synthesis in Escherichia coli was isolated by preparing plasmid DNA from the red- 
brown colony, transfecting it to Escherichia coli cells of the strain XL 1 -Blue and 
selection on ampicillin-containing medium. This plasmid, designated pHPK 
(pHPK is a Lambda ZAP II vector containing an insert isolated from the red-brown 
colony), was used to transform p-carotene-producing Escherichia coli cells 
{Escherichia coli SOLR strain that carry the plasmid pBCAR shown in Figure 4) 
resulting in the formation of red-brown colonies. Carotenoids from this 
transformant, as well as from the host cells (as control) were extracted by acetone 
and analyzed by HPLC. 

HPLC analysis of carotenoids of the host bacteria which synthesized (3- 
carotene {Escherichia coli SOLR strain that carry the plasmid pBCAR shown in 
Figure 4), as compared with a brown-red colony, revealed that only traces of (3- 
carotene were obsers^ed in the transfonucint cells while a new major peak of 
canthaxanthin and another minor peak of echinenone appeared [described in detail 
by Lotan and Hirschberg (1995) FEBS letters 364: 125-128]. These results 
indicate that the cDNA in plasmid pHPK, designated crtO encodes an enzyme with 
p_C-4-oxygenase activity, which converts (5-carotene to canthaxanthin via 
echinenone (see Figure 14). It is, therefore concluded that a single enzyme 
catalyzes this two-step ketonization conversion by acting symmetrically on the 4 
and 4' carbons of the (3- and p'-rings of p-carotene, respectively. 

EXAMPLE 3 
Production of astaxanthin in Escherichia coli cells 

To determine whether P-carotene hydroxylase (e.g., a product of the crtZ 
gene of Erwinia herbicola) can convert thus produced canthaxanthin to astaxanthin 
and/or whether zeaxanthin converted from p-carotene by P-carotene hydroxylase 
can be converted by p-C-4-oxygenase to astaxanthin, the crtO cDNA of 
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Haematococcus plwialis thus isolated, was expressed in Escherichia coli cells 
together with the crtZ gene of Erwinia herbicola. For this purpose, Escherichia 
coli cells of strain SOLR were transfected with either plasmid pASTA alone 
containing, as shown in Figure 8, both crtZ and crtO or, alternatively with both 
plasmids, pHPK containing, as shown in Figure 6, crtO, and pZEAX containing, as 
shown in Figure 5, crtZ. Carotenoids in the resulting transformed cells were 
extracted and analyzed by HPLC as described above. The results, given in Table 
1, show the composition of carotenoids extracted from the cells containing the 
plasmid pASTA. Similar carotenoid composition is found in Escherichia coli cells 
which carry both pHPK and pZEAX. 

TABLE 1 

Carotenoid % of total carotenoid composition 

8.0 
1.7 
4.2 
4.2 
57.8 
1.0 
17.9 
5.2 



P-Carotene 

Echineone 

P -Cryptoxanthin 

Canthaxanthin 

Zeaxanthin 

Adonirubin 

Adonixanthin 

Astaxanthin 



The results presented in Table 1, prove that carotenoids possessing either a 
p-end group or a 4-keto-p-end group act as substrates for the hydroxylation 
reactions catalyzed by crtZ gene product at carbons C-3 and C-3*. The 
hydroxylation of p-carotene and canthaxanthin results in the production of 
zeaxanthin and astaxanthin, respectively. These hydroxylations result in the 
production of astaxanthin and the intermediate ketocarotenoids, 3- 
hydroxyechinenone, adonixanthin and adonimbin. These results further 
demonstrate that astaxanthin can be produced in heterologous cells by expressing 
the gene crtO together with a gene that codes for a p-carotene hydroxylase, 

EXAMPLE 4 

Sequence analysis of the gene for ^-carotene C'4'Oxygenase 

The full length, as was determined by the presence of a poly A tail, of the 
cDNA insert in plasmid pHPK (1771 base pairs) was subjected to nucleotide 
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sequence analysis. This sequence, set forth in SEQ ID NO:l, and its translation to 
an amino acid sequence set forth in SEQ ID NO: 3 (329 amino acids), were 
deposited in EMBL database on May 1, 1995, and obtained the EMBL accession 
numbers X86782 and X86783, respectively. 
5 An open reading frame (ORF) of 825 nucleotides (nucleotides 166 through 

1152 in SEQ ID NO:3) was identified in this sequence. This ORF codes for the 
enzyme P-carotene C-4-oxygenase having 329 amino acids set forth in SEQ ID 
NO:4, as proven by its functional expression in Escherichia coli cells (see 
Example 3 above). The gene for this enzyme was designated crtO, 

10 

EXAMPLE 5 
Transformation of cyanobacteria y\>ith crtO 

The plasmid DNA of pPAN3.5-ICETO, shown in Figure 9, was transfected 

15 into cells of the cyanobacterium Synechococcus PCC7942 according to the method 
described by Golden [Golden SS (1988) Mutagenesis of cyanobacteria by classical 
and gene-transfer-based methods. Methods Enzymol 167: 714-727]. The 
cyanobacterial cells were plated on BGll medium-containing petri dishes that 
contained also chloramphenicol. Colonies of chloramphenicol-resistant 

20 Synechococcus PCC7942 which appeared after ten days were analyzed for their 
carotenoid content. As detailed in Table 2 below, HPLC analysis of these cells 
revealed that the major carotenoid components of the cells was p-carotene, 
echinenone, canthaxanthin, adonirubin and astaxanthin. A similar analysis of the 
v/ild type strain and of Synechococcus PCC7942 transfected with a plasmid in 

25 which the orientation of the crtO gene is reversed (not shown), which is therefore 
not capable of producing an active protein, did not revealed production of 
echinenone, canthaxanthin, adonirubin and astaxanthin. 

These result prove that crtO of Haematococcus pluvialis can be expressed 
in cyanobacteria and that its expression provided a p-C-4-oxygenase enzymatic 

30 activity needed for the conversion of P-carotene to canthaxanthin. This result 
further demonstrates that the endogenous P-carotene hydroxylase of 
Synechococcus PCC7942 is able to convert thus produced canthaxanthin to 
astaxanthin. Since the carotenoid biosynthesis pathway is similar in all green 
photosynthetic organism [see Figures 1 and 10 and, Pecker I, Chamovitz D, Linden 

35 H, Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to ^-carotene is transcriptionally regulated during tomato 
fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966] it is deduced that 
astaxanthin can be produced in algae, and higher plants by expressing crtO in any 
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tissue that express also the endogenous p-carotene hydroxylase. It is further 
deduced that astaxanthin can be produced by any organism provided it contains 
either endogenous or engineered P-carotene biosynthesis pathway, by expressing 
crtO in any tissue that express either endogenous or genetically engineered P- 
carotene hydroxylase. 

TABLE 2 



Carotenoid % of total carotenoid composition 

p-Carotene 3L5 

Echinenone 18.5 

Canthaxanthin 16.1 

Zeaxanthin 22.3 

Adonirubin , 6.0 

Astaxanthin 5.6 



EXAMPLE 6 
Determining the chirality configuration of astaxanthin 
produced in heterologous systems 

The chirality configurations of astaxandiin produced by Escherichia coli 
cells, as described under Example 3 hereinabove, and by cyanobacterium 
Synechococcus PCC7942 cells, as described in Example 5 hereinabove, were 
determined by HPLC of the derived diastereoisomeric camphanates of the 
astaxanthin [Renstrom B, Borch G, Skulberg M and Liaaen-Jensen S (1981) 
Optical purity of (3*S',3S')-astaxanthin from Haematococcus pluvialis. Phytochem 
20: 2561-2565]. The analysis proved that the Escherichia coli and Synechococcus 
PCC7942 cells described above, synthesize pure (3iS,3'S) astaxanthin. 

EXAMPLE 7 
Transformation of a higher plant with crtO 

Producing natural astaxanthin in higher plants has two anticipated 
benefits. First, as a pure chemical, astaxanthin is widely used as feed additive 
for fish. It is a potential food colorant suitable for humans consumption and 
has potential applications in the cosmetic industry. Second, inducing 
astaxanthin biosynthesis in vivo in flowers and fruits will provide attractive 
pink/red colors which will increase their appearance and/or nutritious worth. 
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In flowers and fruits carotenoids are normally synthesized and 
accumulated to high concentration in chromoplasts, a typical pigment- 
containing plastids, thus providing typical intense colors to these organs. 
Inducing synthesis of astaxanthin in chromoplasts enables the accumulation of 
high concentration of this ketocarotenoid. Over-expression of carotenoid 
biosynthesis genes which results in elevated concentrations of carotenoids in 
chloroplasts, or other alterations in carotenoid composition in chloroplasts may 
damage the thylakoid membranes, impair photosynthesis and thus is deleterious 
to the plants. In contrast, increase of carotenoid concentration or alteration in 
carotenoid composition in chromoplasts do not affect the viability of the plant 
nor the yield of fruits and flowers. 

Thus, gene-transfer technology was used to implant the crtO gene 
isolated from the alga Haematococciis phivialis, as described, into a higher 
plant, in such a way that its expression is up-regulated especially in 
chromoplast-containing cells. 

To this end, a T-DNA containing binary plasmid vector as shown in 
Figure 12 was assembled in E. coli from the promoter and coding DNA 
sequences of the transit peptide encoded by the Pds gene from a tomato species 
Lycopersicon esculentum, linked to the coding DNA sequence of crtO from H. 
pluvialis. Upon stable transfer of this DNA construct via Agrobocterium- 
mediated transformation into a tobacco {Nicotiana tabacum NN) plant to form a 
transgenic plant, as described under methods above, the plant acquired the 
ability to produce ketocarotenoids especially in flovv^er tissues (chromoplast- 
containing cells). It should be noted that the Pds gene promoter is capable of 
directing transcription and therefore expression especially in chloroplasts 
and/or chromoplasts-containing tissues of plants. It should be further noted that 
the transit peptide encoded by part of the Pds coding sequence is capable of 
directing conjugated (i.e., in frame) proteins into plant chromoplasts and/or 
chloroplasts. 

As shown in Figure 15, in chromoplasts-containing cells, such as in the 
nectary tissue of the flower of tobacco, this DNA construct induces 
accumulation of astaxanthin and other ketocarotenoids to a higher level which 
alters the color from the normal yellow to red. 

Concentration and composition of carotenoids in chloroplasts-containing 
tissues, such as leaves, and in chromoplast-containing tissues, such as flowers, 
were determined in the transgenic plants and compared to normal non- 
transformed plants. 
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Carotenoids compositions in leaves (chloroplasts-containing tissue) and 
in the nectary tissue of flowers (chromoplast containing tissue) of wild type and 
transgenic tobacco plants were determined by thin layer chromatography (TLC) 
and by high pressure liquid chromatography (HPLC) as described above. 

Total carotenoids concentration in leaves (chloroplasts-containing 
tissue) and in the nectary tissue of flowers (chromoplast containing tissue) of 
wild type and transgenic tobacco plants are summarized in Tables 3 below. 

Percents of carotenoids composition in leaves of wild-type and 
transgenic tobacco plants are summarized in Tables 4 below. 

Percents of carotenoids composition in the nectary tissue of flowers of 
wild-type and transgenic tobacco plants are summarized in Tables 5 below. 



TABLE 3 
fag carotenoids per gr fresh weight 

Wild-type Transgenic with crtO 

Leaf 

(Chloroplasts) 200 240 

Nectary tissue 

(Chromoplasts) 280 360 



TABLE 4 

% of total carotenoids composition in chloroplasts-containing tissue (leaf) 



Wild-type Transgenic 

p-carotene 29.9 26.7 

neoxanthin 5.0 5.9 

violaxanthin 11.6 18.1 

antheraxanthin 4.9 2.6 

lutein 43.9 41.4 

zeaxanthin 4.7 4.3 

astaxanthin + adonirubin 0.0 1.0 
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TABLE 5 

% of total carotenoid composition in chromoplasts-containing tissue (flower) 





Wild-type 


Transgenic 


beta-carotene 


58.1 


21.0 


violaxanthin 


40.3 


1.5 


lutein 


0.0 


I.l 


zeaxanthin 


1.6 


1.0 


hydroxy echinenone 


0.0 


13.7 


3 'hydroxy echinenone 


0.0 


4.1 


adonirubin 


0.0 


22.4 


adonixanthin 


0.0 


8.7 


astaxanthin 


0.0 


26.5 



15 

Please note the elevated content of hydroxyechinenone, 
3 'hydroxy echinenone, adonirubin, adonixanthin and astaxanthin especially in 
the chromoplast containing tissue of the transgenic tobacco plants. 

20 Thus, the present invention successfully addresses the shortcomings of the 

presently known configurations by enabling a relatively low cost biotechnological 
production of (3iS',3*iS^ astaxanthin by providing a peptide having a P-C-4- 
oxygenase activity; a DNA segment coding for this peptide; an RNA segments 
coding for this peptide; a recombinant DNA molecule comprising a vector and the 

25 DNA segment; a host containing the above described recombinant DNA molecule 
or DNA segment; and of a method for biotechnologically producing (3*5,3 'aS^ 
astaxanthin or a food additive containing {3S,3'S) astaxanthin, using the host. 



30 



While the invention has been described with respect to a limited number of 
embodiments, it will be appreciated that many variations, modifications and other 
applications of the invention may be made. 



wo 98/18910 



PCTAJS97/17819 



42 

SEQUENCE LISTING 



(1) 



GENERAL INFORMATION: 



Ci) 


APPLICANTS: 


Joseph Hirschberg, Tamar Lotan and 
Mark Harker 


(ii) 


TITLE OF INVENTION: 


Polynucleotide molecule from 
Haematococcus pluvial is encoding a 
polypeptide having a p-C- 4 -oxygenase 

activity for biotechnological product i 
(3S,3'S) astaxanthfn. 


< i i i ) 


NUMBER OF SEQUENCES: 


A 


(iv) 


CORRESPONDENCE ADDRESS: 






(A) ADDRESSEE: 


Mark M, Friedman c/o Robert Sheinbein 




(B) STREET: 


29A0 Birchtree space lane 




(C) CITY: 


Si Iver Spring 




<D) STATE: 


Maryland 




(E) COUNTRY: 


United States of America 




(F) ZIP: 


20906 


(V) 


COMPUTER READABLE FORM: 






(A) MEDIUM TYPE: 


1.44 megabyte, 3.5" microdisk 




<B) COMPUTER: 


Twinhead SI imnote-890TX 




(C) OPERATING SYSTEM 


: MS DOS version 6.2, 
Windows version 3.11 




(D) SOFTWARE: 


Word for Windows version 2.0 


(vi ) 


CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 




(B) FILING DATE: 






<C) CLASSIFICATION: 




(vii) 


PRIOR APPLICATION DATA: 






(A> APPLICATION NUMBER: 




<B) FILING DATE: 




(viii) 


ATTORNEY/AGENT INFORMATION: 




(A) NAME: 


Friedmam, Mark M. 




(B) REGISTRATION NUMBER: 33,883 




(C) REFHKENCE/DOCKET 


NUMBER: 325/5 


(IX) 


TELECOMMUNICATION INFORMATION: 




(A) TELEPHONE: 


972-3-5625553 




(B) TELEFAX: 


972-3-5625554 




CO TELEX: 





(2) 



INFORMATION FOR SEQ ID N0:1: 
(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 
(B) TYPE: 
<C) STRANDEDNESS: 
(D) TOPOLOGY: 
(xi> SEQUENCE DESCRIPTION: 



1771 base pairs 
nucleic acid 
double 
linear 

SEQ' ID N0:1: 



6GC 


ACG 


AGC 


TTG 


CAC 


GCA AGT 


CAG 


CGC 


GCG 


CAA 


GTC AAC 


ACC 


TGC CGG 


48 


TCC 


ACA 


GCC 


TCA 


AAT 


AAT 


AAA 


GAG 


CTC 


AAG CGT 


TTG TGC 


GCC 


TCG ACG 


96 


TGG 


CCA 


GTC 


TGC 


ACT 


GCC 


TTG 


AAC 


CCG 


CGA 


GTC 


TCC CGC 


CGC 


ACT GAC 


144 


TGC 


CAT 


AGC 


ACA 


GCT 


AGA 


CGA 


ATG 


CAG 


CTA 


GCA 


GCG ACA 


GTA 


ATG TTG 


192 


GAG 


CAG 


CTT 


ACC G6A AGC GCT GAG GCA CTC AAG GAG AAG GAG AAG GAG 


240 


GTT 


GCA 


GGC 


AGC 


TCT 


GAC 


GTG 


TTG 


CGT 


ACA 


TGG 


GCG ACC 


CAG 


TAC TCG 


288 


CTT 


CCG 


TCA 


GAA 


GAG 


TCA 


GAC 


GCG 


GCC 


CGC 


CCG 


GGA CTG 


AAG 


AAT GCC 


336 


TAC 


AAG 


CCA 


CCA 


CCT 


TCC 


GAC 


ACA 


AAG 


GGC 


ATC 


ACA ATG GCG 


CTA CGT 


384 


GTC 


ATC 


GGC 


TCC 


TGG 


GCC 


GCA 


GTG 


TTC 


CTC 


CAC 


GCC ATT 


TTT 


CAA ATC 


432 


AAG 


CTT 


CCG 


ACC 


TCC 


TTG 


GAC 


CAG 


CTG 


CAC 


TGG 


CTG CCC 


GTG 


TCA GAT 


480 


GCC 


ACA 


GCT 


CAG 


CT6 


GTT 


AGC 


GGC 


ACG 


AGC 


AGC 


CTG CTC 


GAC 


ATC GTC 


528 
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GTA 


GTA 


TTC 


TTT 


GTC 


CTG 


GAG 


TTC 


CTG 


TAC 


ACA 


GGC 


CTT 


TTT 


ATC 


ACC 


576 


ACG 


CAT 


GAT 


GCT 


ATG 


CAT 


GGC 


ACC 


ATC 


GCC 


ATG 


AGA 


AAC 


AGG 


CAG 


CTT 


62A 


AAT 


GAG 


TTC 


TTG 


GGC 


AGA 


GTA 


TGC 


ATC 


TCC 


TTG 


TAC 


GCC 


TGG 


TTT 


GAT 


672 


TAG 


AAC 


ATG 


CTG 


CAC 


CGC 


AAG 


CAT 


TGG 


GAG 


CAC 


CAC 


AAC 


CAC 


ACT 


GGC 


720 


GAG 


GTG 


GGC 


AAG 


GAC 


CCT 


GAC 


TTC 


CAC 


AGG 


GGA 


AAC 


CCT 


GGC 


ATT 


GTG 


768 


ccc 


TGG 


TTT 


GCC 


AGC 


TTC 


ATG 


TCC 


AGC 


TAC 


ATG 


TCG 


ATG 


TGG 


CAG 


TTT 


816^ 


GCG 


CGC 


CTC 


GCA 


TGG 


TGG 


ACG 


GTG 


GTC 


ATG 


CAG 


CTG 


CTG 


GGT 


GCG 


CCA 


864 


ATG 


GCG 


AAC 


CTG 


CTG 


GTG 


TTC 


ATG 


GCG 


GCC 


GCG 


CCC 


ATC 


CTG 


TCC 


GCC 


912 


TTC 


CGC 


TTG 


TTC 


TAC 


TTT 


GGC 


ACG 


TAC 


ATG 


CCC 


CAC 


AAG 


CCT 


GAG 


CCT 


960 


GGC 


GCC 


GCG 


TCA 


GGC 


TCT 


TCA 


CCA 


GCC 


GTC 


ATG 


AAC 


TGG 


TGG 


AAG 


TCG 


1008 


CGC 


ACT 


AGO 


CAG 


GCG 


TGC 


GAC 


CTG 


GTC 


AGC 


TTT 


CTG 


ACC 


TGC 


TAC 


CAC 


1056 


TTC 


GAC 


CTG 


CAC 


TGG 


GAG 


CAC 


CAC 


CGC 


TGG 


CCC 


TTC 


GCC 


CCC 


TGG 


TGG 


1104 


GAG 


CTG 


CCC 


AAC 


TGC 


CGC 


CGC 


CTG 


TCT 


GGC 


CGA 


GGT 


CTG 


GTT 


CCT 


GCC 


1152 


TAG 


CTG 


GAC 


ACA 


CTG 


CAG 


TGG 


GCC 


CTG 


CTG 


CCA 


GCT 


GGG 


CAT 


GCA 


GGT 


1200 


TGT 


GGC 


AGG 


ACT 


GGG 


TGA 


GGT 


GAA 


AAG 


CTG 


CAG 


GCG 


CTG 


CTG 


CCG 


GAC 


1248 


ACG 


CTG 


CAT 


GGG 


CTA 


CCC 


TGT 


GTA 


GCT 


GCC 


GCC 


ACT 


AGG 


GGA 


GGG 


GGT 


1296 


TTG 


TAG 


CTG 


TCG 


AGC 


TTG 


CCC 


CAT 


GGA 


TGA 


AGC 


TGT 


GTA 


GTG 


GTG 


CAG 


1344 


GGA 


GTA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC 


1:tt 


GCA 


GGA 


GAT 


GTC 


TTG 


CGT 


CGG 


1392 


GAG 


GAG 


TGT 


TGG 


GCA 


GTG 


TAG 


ATG 


CTA 


TGA 


TTG 


TAT 


CTT 


AAT 


GCT 


GAA 


1440 


GCC 


TTT 


AGG 


GGA 


GCG 


ACA 


CTT 


AGT 


GCT 


GGG 


CAG 


GCA 


ACG 


CCC 


TGC 


AAG 


1488 


GTG 


CAG 


GCA 


CAA 


GCT 


AGG 


CTG 


GAC 


GAG 


GAC 


TCG 


GTG 


GCA 


GGC 


AGG 


TGA 


1536 


AGA 


GGT 


GCG 


GGA 


GGG 


TGG 


TGC 


CAC 


ACC 


CAC 


TGG 


GCA 


AGA 


CCA 


TGC 


TGC 


1584 


AAT 


GCT 


GGC 


GGT 


GTG 


GCA 


GTG 


AGA 


GCT 


GCG 


TGA 


TTA 


ACT 


GGG 


CTA 


TGG 


1632 


ATT 


GTT 


TGA 


GCA 


GTC 


TCA 


CTT 


ATT 


CTT 


TGA 


TAT 


AGA 


TAC 


TGG 


TCA 


GGC 


1680 


AGG 


TCA 


GGA 


GAG 


TGA 


GTA 


TGA 


ACA AGT 


TGA 


GAG 


GTG 


GTG 


CGC 


TGC 


CCC 


1728 


TGC 


GCT 


TAT 


GAA 


GCT 


GTA 


ACA 


ATA 


AAG 


TGG 


TTC 












1771 



(2) INFORMATION FOR SEQ ID N0:2: 



SEQUENCE 


CHARACTERISTICS: 




(A) 


LENGTH: 


1771 bases 


(B) 


TYPE: 


nucleic acid 


(C) 


STRANDEDNESS: 


single 


(D) 


TOPOLOGY : 


I inear 


SEQUENCE 


DESCRIPTION': 


SEQ ID N0:2: 



GGC ACG AGC UUG CAC GCA AGU CAG CGC 
UCC ACA GCC UCA AAU AAU AAA GAG CUC 
UGG CCA GUC UGC ACU GCC UUG AAC CCG 
UGC CAU AGC ACA GCU AGA CGA AUG CAG 
GAG CAG CUU ACC GGA AGC GCU GAG GCA 
GUU GCA GGC AGC UCU GAC GUG UUG CGU 
CUU CCG UCA GAA GAG UCA GAC GCG GCC 
UAC AAG CCA CCA CCU UCC GAC ACA AAG 
GUC AUC GGC UCC UGG GCC GCA GUG UUC 
AAG CUU CCG ACC UCC UUG GAC CAG CUG 
GCC ACA GCU CAG CUG GUU AGC GGC ACG 
GUA GUA UUC UUU GUC CUG GAG UUC CUG 
ACG CAU GAU GCU AUG CAU GGC ACC AUC 
AAU GAC UUC UUG GGC AGA GUA UGC AUC 
UAC AAC AUG CUG CAC CGC AAG CAU UGG 
GAG GUG GGC AAG GAC CCU GAC UUC CAC 
CCC UGG UUU GCC AGC UUC AUG UCC AGC 
GCG CGC CUC GCA UGG UGG ACG GUG GUC 
AUG GCG AAC CUG CUG GUG UUC AUG GCG 
UUC CGC UUG UUC UAC UUU GGC ACG UAC 
GGC GCC GCG UCA GGC UCU UCA CCA GCC 
CGC ACU AGC CAG GCG UCC GAC CUG GUC 



GCG 


CAA 


GUC 


AAC 


ACC 


UGC 


CGG 


48 


AAG 


CGU 


UUG 


UGC 


GCC 


UCG 


ACG 


96 


CGA 


GUC 


UCC 


CGC 


CGC 


ACU 


GAC 


144 


CUA 


GCA 


GCG 


ACA 


GUA 


AUG 


UUG 


192 


CUC 


AAG 


GAG 


AAG 


GAG 


AAG 


GAG 


240 


ACA 


UGG 


GCG 


ACC 


CAG 


UAC 


UCG 


288 


CGC 


CCG 


GGA 


CUG 


AAG 


AAU 


GCC 


336 


GGC 


AUC 


ACA 


AUG 


GCG 


CUA 


CGU 


384 


CUC 


CAC 


GCC 


AUU 


UUU 


CAA 


AUC 


432 


CAC 


UGG 


CUG 


CCC 


GUG 


UCA 


GAU 


480 


AGC 


AGC 


CUG 


CUC 


GAC 


AUC 


GUC 


528 


UAC 


ACA 


GGC 


CUU 


UUU 


AUC 


ACC 


576 


GCC AUG 


AGA 


AAC 


AGG 


CAG 


CUU 


624 


UCC 


UUG 


UAC 


GCC 


UGG 


UUU 


GAU 


672 


GAG 


CAC 


CAC 


AAC 


CAC 


ACU 


GGC 


720 


AGG 


GGA AAC 


CCU 


GGC 


AUU 


GUG 


768 


UAC 


AUG 


UCG 


AUG 


UGG 


CAG 


UUU 


816 


AUG 


CAG 


CUG 


CUG 


GGU 


GCG 


CCA 


864 


GCC 


GCG 


CCC 


AUC 


CUG 


UCC 


GCC 


912 


AUG 


CCC 


CAC 


AAG 


CCU 


GAG 


CCU 


960 


GUC 


AUG 


AAC 


UGG 


UGG 


AAG 


UCG 


1008 


AGC 


UUU 


CUG 


ACC 


UGC UAC 


CAC 


1056 
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UUC GAC CUG CAC UGG GAG CAC CAC CGC U66 CCC UUC GCC CCC UGG UGG 1104 

GAG CUG CCC AAC UGC CGC CGC CUG UCU GGC CGA GGU CUG GUU CCU GCC 1152 

UAG CUG GAC ACA CUG CAG UGG GCC CUG CUG CCA GCU GGG CAU GCA GGU 1200 

UGU GGC AGG ACU GGG UGA GGU GAA AAG CUG CAG GCG CUG CUG CCG GAC 1248 

ACG CUG CAU GGG CUA CCC UGU GUA GCU GCC GCC ACU AGG GGA GGG GGU 1296 

UUG UAG CUG UCG AGC UUG CCC CAU GGA UGA AGC UGU GUA GUG GUG CAG 1344 

GGA GUA CAC CCA CAG GCC AAC ACC CUU GCA GGA GAU GUC UUG CGU CGG 1392 

GAG GAG UGU UGG GCA GUG UAG AUG CUA UGA UUG UAU CUU AAU GCU GAA 1440 

GCC UUU AGG GGA GCG ACA CUU AGU GCU GGG CAG GCA ACG CCC UGC AAG 1488 

GUG CAG GCA CAA GCU AGG CUG GAC GAG GAC UCG GUG GCA GGC AGG UGA 1536 

AGA GGU GCG GGA GGG UGG UGC CAC ACC CAC UGG GCA AGA CCA UGC UGC 15&4 

AAU GCU GGC GGU GUG GCA GUG AGA GCU GCG UGA UUA ACU GGG CUA UGG 1632 

AUU GUU UGA GCA GUC UCA CUU AUU CUU UGA UAU AGA UAC UGG UCA GGC 1680 

AGG UCA GGA GAG UGA GUA UGA ACA AGU UGA GAG GUG GUG CGC UGC CCC 1728 

UGC GCU UAU GAA GCU GUA ACA AUA AAG UGG UUC 1771 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1771 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS: . double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GGC ACG AGC TTG CAC GCA AGT CAG CGC GCG CAA GTC AAC ACC TGC CGG 48 
TCC ACA GCC TCA AAT AAT AAA GAG CTC AAG CGT TTG TGC GCC TCG ACG 96 
TGG CCA GTC TGC ACT GCC TTG AAC CCG CGA GTC TCC CGC CGC ACT GAC 144 
TGC CAT AGC ACA GCT AGA CGA ATG CAG CTA GCA GCG ACA GTA ATG TTG 192 

Met Gin Leu Ala Ala Thr Val Met Leu 
5 

GAG CAG CTT ACC GGA AGC GCT GAG GCA CTC AAG GAG AAG GAG AAG GAG Z^^Q 
Glu Gin Leu Thr Gly Ser Ala Glu Ala Leu Lys Glu Lys Glu Lys Glu 
10 15 20 25 

GTT GCA GGC AGC TCT GAC 6TG TTG CGT ACA TGG GCG ACC CAG TAC TCG 288 
Val Ala Gly Ser Ser Asp Val Leu Arg Thr Trp Ala Thr Gin Tyr Ser 

30 35 40 

CTT CCG TCA GAA GAG TCA GAC GCG GCC CGC CCG GGA CTG AAG AAT GCC 336 
Leu Pro Ser Glu Glu Ser Asp Ala Ala Arg Pro Gly Leu Lys Asn Ala 

45 50 55 

TAC AAG CCA CCA CCT TCC GAC ACA AAG GGC ATC ACA ATG GCG CTA CGT 384 
Tyr Lys Pro Pro Pro Ser Asp Thr Lys Gly lie Thr Met Ala Leu Arg 

60 65 70 

GTC ATC GGC TCC TGG GCC GCA GTG TTC CTC CAC GCC ATT TTT CAA ATC 432 
Val lie Gly Ser Trp Ala Ala Val Phe Ueu His Ala He Phe Gin lie 

75 80 85 

AAG CTT CCG ACC TCC TTG GAC CAG CTG CAC TGG CTG CCC GTG TCA GAT 480 
Lys Leu Pro Thr Ser Leu Asp Gin Leu His Trp Leu Pro Val Ser Asp 
90 95 100 105 

GCC ACA GCT CAG CTG GTT AGC GGC ACG AGC AGC CTG CTC GAC ATC GTC 528 
Ala Thr Ala Gin Leu Val Ser Gly Thr Ser Ser Leu Leu Asp lie Val 

110 115 120 

GTA GTA TTC TTT GTC CTG GAG TTC CTG TAC ACA GGC CTT TTT ATC ACC 576 
Val Val Phe Phe Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe He Thr 

125 130 135 

ACG CAT GAT GCT ATG CAT GGC ACC ATC GCC ATG AGA AAC AGG CAG CTT 624 
Thr His Asp Ala Met His Gly Thr lie Ala Met Arg Asn Arg Gin Leu 
140 145 150 
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AAT GAC TTC TTG GGC AGA GTA TGC ATC TCC TTG TAC GCC TGG TTT GAT 672 
Asn Asp Phe Leu Gly Arg Val Cys lie Ser Leu Tyr Ala Trp Phe Asp 

155 160 165 

TAC AAC ATG CTG CAC CGC AAG CAT TGG GAG CAC CAC AAC CAC ACT GGC 720 
Tyr Asn Met Leu His Arg Lys His Trp GLu His His Asn His Thr Gly 
170 175 180 185 

GAG GTG GGC AAG GAC CCT GAC TTC CAC AGG GGA AAC CCT GGC ATT GTG 768 
Glu Val Gly Lys Asp Pro Asp Phe His Arg Gly Asn Pro Gly He Val 

190 195 200 

CCC TGG TTT GCC AGC TTC ATG TCC AGC TAC ATG TCG ATG TGG CAG TTT 816 
Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met Ser Met Trp Gin Phe 

205 210 215 

GCG CGC CTC GCA TGG TGG ACQ GTG GTC ATG CAG CTG CTG GGT GCG CCA 864 
Ala Arg Leu Ala Trp Trp Thr Val Val Met Gin Leu Leu Gly Ala Pro 

220 225 230 

ATG GCG AAC CTG CTG GTG TTC ATG GCG GCC GCG CCC ATC CTG TCC GCC 912 
Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala 

235 240 245 

TTC CGC TTG TTC TAC TTT GGC ACG TAC ATG CCC CAC AAG CCT GAG CCT 960 
Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Met Pro His Lys Pro Glu Pro 
250 255 260 265 

GGC GCC GCG TCA GGC TCT TCA CCA GCC GTC ATG AAC TGG TGG AAG TCG 1008 
Gly Ala Ala Ser Gly Ser Ser Pro Ala Val Met Asn Trp Trp Lys Ser 

270 275 280 

CGC ACT AGC CAG GCG TCC GAC CTG GTC AGC TTT CTG ACC TGC TAC CAC 1056 
Arg Thr Ser Gin Ala Ser Asp Leu Val Ser Phe Leu Thr Cys Tyr His 

285 290 295 

TTC GAC CTG CAC TGG GAG CAC CAC CGC TGG CCC TTC GCC CCC TGG TGG 1104 
Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp 

300 305 310 

GAG CTG CCC AAC TGC CGC CGC CTG TCT GGC CGA GGT CTG GTT CCT GCC 1152 
Glu Leu Pro Asn Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala 

315 320 325 

TAG CTG GAC ACA CTG CAG TGG GCC CTG CTG CCA GCT GGG CAT GCA GGT 1200 
TGT GGC AGG ACT GGG TGA GGT GAA AAG CTG CAG GCG CTG CTG CCG GAC 1248 
ACG CTG CAT GGG CTA CCC TGT GTA GCT GCC GCC ACT AGG GGA GGG GGT 1296 
TTG TAG CTG TCG AGC TTG CCC CAT GGA TGA AGC TGT GTA GTG GTG CAG 1344 
GGA GTA CAC CCA CAG GCC AAC ACC CTT GCA GGA GAT GTC TTG CGT CGG 1392 
GAG GAG TGT TGG GCA GTG TAG ATG CTA TGA TTG TAT CTT AAT GCT GAA 1440 
GCC TTT AGG GGA GCG ACA CTT AGT GCT GGG CAG GCA ACG CCC TGC AAG 1488 
GTG CAG GCA CAA GCT AGG CTG GAC GAG GAC TCG GTG GCA GGC AGG TGA 1536 
AGA GGT GCG GGA GGG TGG TGC CAC ACC CAC TGG GCA AGA CCA TGC TGC 1584 
AAT GCT GGC GGT GTG GCA GTG AGA GCT GCG TGA TTA ACT GGG CTA TGG 1632 
ATT GTT TGA GCA GTC TCA CTT ATT CTT TGA TAT AGA TAC TGG TCA GGC 1680 
AGG TCA GGA GAG TGA GTA TGA ACA AGT TGA GAG GTG GTG CGC TGC CCC 1728 
TGC GCT TAT GAA GCT GTA ACA ATA AAG TGG TTC 1771 

(2> INFORMATION FOR SEQ ID N0:4: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

<B> TYPE: amino acid 

(C) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Met Gin Leu Ala Ala Thr Val Met Leu 
5 

Glu Gin Leu Thr Gly Ser Ala Glu Ala Leu Lys Glu Lys Glu Lys Glu 
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10 15 '20 25 

Val Ala Gly Ser Ser Asp Val Leu Arg Thr Trp Ala Thr Gin Tyr Ser 

30 35 40 

Leu Pro Ser Glu GLu Ser Asp Ala Ala Arg Pro Gly Leu Lys Asn Ala 

45 50 55 

Tyr Lys Pro Pro Pro Ser Asp Thr Lys Gly Ue Thr Met Ala Leu Arg 

60 65 70 

Val lie Gly Ser Trp Ala Ala Val Phe Leu His Ala lie Phe Gin He 

75 80 85 

Lys Leu Pro Thr ser Leu Asp Gin Leu His Trp Leu Pro Val Ser Asp 
90 95 100 105 

Ala Thr Ala Gin Leu Val Ser Gly Thr Ser Ser Leu Leu Asp lie Val 

no 115 120 

Val Val Phe Phe Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe lie Thr 

125 130 135 

Thr His Asp Ala Met Kis Gly Thr J le Ala Met Arg Asn Arg Gin Leu 

140 145 150 

Asn Asp Phe Leu Gly Arg Val Cys lie Ser Leu Tyr Ala Trp Phe Asp 

155 160 165 

Tyr Asn Met Leu His Arg Lys His Trp Glu His His Asn His Thr Gly 
170 175 -180 185 

Glu Val Gly Lys Asp Pro Asp Phe His Arg Gly Asn Pro Gly lie Val 

190 195 200 

Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met Ser Met Trp Gin Phe 

205 210 215 

Ala Arg Leu Ala Trp Trp Thr Val Val Met Gin Leu Leu Gly Ala Pro 

220 225 230 

Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro lie Leu Ser Ala 

235 240 245 

Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Met Pro His .Lys Pro Glu Pro 
250 255 260 265 

Gly Ala Ala Ser Gly Ser Ser Pro Ala Val Met Asn Trp Trp Lys Ser 

270 275 280 

Arg Thr Ser Gin Ala Ser Asp Leu Val Ser Phe Leu Thr Cys Tyr His 

285 290 295 

Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp 

300 305 310 

Glu Leu Pro Asn Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala 
315 320 325 
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WHAT IS CLAIMED IS: 

1. A DNA segment comprising a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in SEQ ID NO:4 

2. A DNA segment as in claim 1, wherein said nucleotide sequence is a 
variant selected from the group of variants consisting of allelic variants, species 
variants, naturally occurring variants, man-induced variants and combinations 
thereof 

3. A DNA segment as in claim 1, wherein said nucleotide sequence 
includes a sequence as set forth in SEQ ID NO: 1 . 

4. A DNA segment as in claim 1, wherein said nucleotide sequence 
includes a sequence as set forth between and including nucleotides 166 and 1 152 
ofSEQIDNO:l. 

5. A DNA segment comprising a nucleotide sequence hybridizing 

under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:L 

6. A DNA segment comprising a nucleotide sequence hybridizing- 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 11 52 of SEQ ID NO:l. 

7. A DNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:2. 

8. A DNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 1 66 through 1 1 52 of SEQ ID NO:2. 

9. An RNA segment comprising a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in SEQ ID NO:4 

10. An RNA segment as in claim 9, wherein said nucleotide sequence is 
a variant selected from the group of variants consisting of allelic variants, species 
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variants, naturally occurring variants, man-induced variants and combinations 
thereof. 

1 1 . An RNA segment as in claim 9, wherein said nucleotide sequence 
includes a sequence as set forth in SEQ ID NO:2, 

12. An RNA segment as in claim 9, wherein said nucleotide sequence 
includes a sequence as set forth between and including nucleotides 166 and 1152 
ofSEQIDNO:2. 

13. An RNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO: 1 . 

14. An RNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 152 of SEQ ID NO:L 

15. An RNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe inck\ding at 
least fifteen successive nucleotides of SEQ ID N0:2. 

16. An RNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 152 of SEQ ID NO:2. 

17. A polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene, allelic and species variants, and functional 
naturally occurring and man-induced variants thereof. 

18. A polypeptide as in claim 17, wherein said amino acid sequence is as 
set forth in SEQ ID NO:4. 

19. A polypeptide comprising an amino acid sequence homologous to 
the sequence set forth in SEQ ID NO:4. 

20. A polypeptide comprising an amino acid sequence being encoded by 
a DNA segment, said DNA segment hybridizing under low stringency conditions 
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to nucleotides 166' through 1 152 of SEQ ID NO:l, the polypeptide having a p-C-4- 
oxygenase activity. 

21. A recombinant vector DNA molecule comprising a DNA segment as 
in claim 2. 

22. A host comprising a recombinant vector DNA molecule as in claim 
21, said host is selected from the group consisting of a cell and an organism. 

23. A host comprising a DNA segment as in claim 2, said host is 
selected from the group consisting of a cell and an organism. 

24. A method of producing xanthophylls selected from the group 
consisting of astaxanthin, canthaxanthin, echinenone, cryptoxanthin, 
isocryptoxanthin, hydroxyechinenonc, zeaxanthin, adonirubin or adonixanthin and 
combinations thereof, comprising the steps of: 

(a) providing a host as in claim 22; 

(b) providing said host with growing conditions for production of the 

xanthophylls; and 

(c) extracting the xanthophylls from said host. 

25. A method of producing xanthophylls selected from the group 
coasisliiig of astaxanthia, canihaxanihin, echinenone, isocryptoxanthin, 
cryptoxanthin, hydroxyechinenonc, zeaxanthin, adonirubin or adonixanthin and 
combinations thereof, comprising the steps of: 

(a) providing a host as in claim 23; 

(b) providing said host with growing conditions for production of the 
xanthophylls; and 

(c) extracting the xanthophylls from said host. 

26. A host as in claim 22, wherein said host is used as a food additive. 



27. A host as in claim 23, wherein said host is used as a food additive. 



28. A transgenic plant expressing a transgene coding for a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring or man-induced 
variants thereof. 
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29. A transgenic plant as in claim 28, wherein said expression is highest 
in chromoplasts-containing tissues. 

30. A recombinant DNA vector comprising a first DNA segment 
encoding a polypeptide for directing a protein into plant chloroplasts or 
chromoplasts and an in frame second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 
induced variants thereof. 

31. A recombinant DNA vector as in claim 30, wherein said first DNA 
segment is derived from the Pds gene of tomato. 

32. A recombinant DNA vector comprising a first DNA segment 
including a promoter highly expressible in plant chloroplasts or chromoplasts- 
containing tissues and a second DNA segment encoding a polypeptide including an 
amino acid sequence corresponding to Haematococcus pluvialis crtO gene, allelic 
and species variants or functional naturally occurring and man-induced variants 
thereof. 

33. A recombinant DNA vector as in claim 30, wherein said first DNA 
segment is derived from the Pds gene of tomato. 



wo 98/18910 



PCT/US97/17819 



Fig. 1 (prior art) 



1/11 



mevalonate — ► -AxCHjOPP ► A-^CHgOPP 



IPP 



IPP 



DMAPP 



IPP 



IPP 



CH2OPP 



CH2OPP 



GGPP 



GPP 



FPP 



CH2OPP GGPP 



H CH. 



PPPP 



-2H 



phytoene 



-2H 



phytofluene 



-2H 



i 



Q -carotene 



-2H 



1 



neurosporene 



lycopene 



• 



wo 98/18910 PCT/US97/17819 



2/11 



FIG. 2-A 



OITOA.SCQ 
CRTDJ.SCQ 

CRTOA. SCQ 
CXTOJ.SEQ 

CRTOA. seq 

CRTOJ.SCQ 

CRTOA. SEQ 
CRTOJ.SCQ 

CRTOA. S£Q 
CRTOJ.SCQ 

CRTOA, srq 

CRTOJ.ScQ 

CRTOA. S£Q 

CRTOJ.SlQ 

CRTOA. SEQ 
CRTOJ.SCQ 

CRTOA. SCQ 
CRTOJ.SCQ 

CRTOA. SEQ 
CRTOJ.SCQ 

CRTOA. SCQ 
CRTOJ.SCQ 

CRTOA. SCQ 
CRTOJ.SCQ 

CRTOA. SCQ 
CRTOJ.SCQ 



200 HQ 220 230 Z^O 230 

ACCACCTTAOrcCAACCCCTC acccactcaaccacaaccac - AACC ACCTTCCACCCACC 

. ... a. a. «... 

, il?!!! Z ••• .... aaaa 

ATCCAarrCCCATaXCACTAATCCTa:ACCACAAACCCACTCACCC«XTCCTn:C^ 
10 20 30 1*0 30 60 

260 270 • 2aO 290 300 310 

TCTCACC iC \ llXLTACATCCCCCACCCACTACnXC I ILCC rCACAACACTCACACCCC 

* ..... a.. a , ••.*•••• •««•• . a. a* .a. .a. a. a 

CCACACCICI lX;ACACCCTCCCa:ACAC«7rATCACATtXCATCCCAaCClCACACCCA 
70 to 90 100 no 120 

320 330 3^0 350 360 370 

CCCCCCCCCCCACTCAACAArCCCTACAACCCACCAC H IC LCACaCAAACCCCATCACA 

.f ** * • a aaaa a*. aa aaa % «• aaa • aaaaaataaaa 

aa a. aa a aa aa* « aaaaaaa* «• >aa a aa aaa a »0»»m»99m»m 

CCTOrrCCTCaxn'AAACOCCCCrACAAACCTCCACO.TC^ 

130 \U0 150 160 170 180 

350 390 ^rOO 610 620 630 

ATCCCCCTACCTCTCATaXCTCCTCCCCCCCAC 10 i i CCTCCACCCCA l 1 i i iC UATC 



ATCCCCCTCACCATCATTax:AC(7rCCACCCCAC IACaCCCAA TA mCA^VTC 

190 200 210 220 230 260 

660 650 660 670 630 690 

AACCnXrCAOCTCCrrCCACCACCVaUCTCCCTCCCCCTCT^ 

a .aa ..aaa aaa m»»m»m»»»* » • t a m m aaa. aaaa. a. a.aaaaaa a. a 

ACCCTAttCACATOIATCCACC^aTTC^^ 

230 260 270 280 290 300 

300 310 320 330 560 330 

CTCCTTACCCCCACCACCiUXCTCCTCCACATCCTC^^^ 

.* • ..aa a t*»»m»mn mm ...aaa a aaa aaa a.aa aa .aaaaa 

crrrrccaxcAaKCclccACCcrAaa^ 

310 320 330 360 350 360 

360 570 380 390 600 6i0 

CTCTACACACCCCi 1 1 1 rATCACCACCCATCATCCTATCCATCCCACCATCCCCATCACA 



CTCTTACACrCGTCTATTCATC^CCAC^CATCACCXAATCCATC^ 

370 330 390 600 6(0 620 

620 630 660 630 660 670 

AACACCCACCTTAATCA Ci ICl IC CCCACACTATCCAI C I CC i IC r A CCCCTCCTnCAT 



CACACCCACCTCAATCAT CiCCt tC CCAACATCTCCATATCACTCTACCC HtCn IC AC 
630 660 650 660 670 680 

680 690 700 710 720 730 

tacaacatcctccaccccuccattcccaccaccac\accacactccccacctccccaac 

aaaaaaaaa taaaaaaa aaaaaaaaaaaaaaaaa aaaaaaa. aaaa* aa 
•*•* aaaaaaaaa aaaaaaa* aaaaaaaaaaaaaaaaa ••mm*m»» aaaaa aa 

TACACCATCCTCC^TCCCAACCACTCCCACCACCACA^CCATACTCCCCAACTCCCCAAA 
690 5C0 310 320 330 360 

760 750 760 770 780 790 

CACCCTCACrra>CA(XCCA.AACCCTCCC\TTCTCCCCTCCTTTCCCACCnCATC^ 

Jj;;;;»Jj;;»j;jJ JJ*i;; J** •••• aaaa. a.. a.aaa.a.a.aaaa. 

CACcacAcrTcacA.\ccc;i>ATCccccccfrckccc^^ 

330 360 570 330 390 600 

800 810 820 830 860 830 

ACCTACATCTCCATCTCCCAGTrrCCCCCCCTCCCATCCTCCACCCTCCTCATCCACCTG 

...... .a. ..a. aa a. aaaaaaaaa a ...a. .aaaa .. 

...aa...... •.•aaa...aa.. a. Sa..SSSa! . i Z i I S .iaa. .. 

ACCTACATCtCCC7CTt;CCACrrTCCCCCCCTCCCATCCTCCCCACTCCTCATCCAAA 

610 620 630 660 630 660 

860 870 830 890 900 910 

CTCCCTC-CCCAATCCCCAACCTCCTCCTCTTCATCCCCCCCCCCCCCATCCTCTCCCCC 

• 5*5 aSssa iz i I la ss s.sissii s. r» mil :ssr 1 1 
CTCCCCCCCCCCATCCCAA*TCTCCTACTCrrCATCCCTCCACC^^ 

670 680 690 700 710 720 

920 930 960 930 960 970 

rrCCCCTTCTTCTACTTTCCCACCTACATCCCCCACAACCCTCACCCTCCCCCCCCCTCA 

'*■*•■ * S.SSaSSa SSaaS aaa SSlT .aaaaaaaaaaaaa aaa . aa aa 

rrcccccTcrTCTAcrrccccAcrTACcrcccAclc^ 

730 760 730 760 7 70 780 



wo 98/18910 



PCT/US97/17819 



3/11 



FIG. 2-B 



980 990 1000 lOlO 1020 1030 

CRTOA . ScQ CCCTCrrCACCACCCCTCATC AACTCCTCC AACTCCCCCACTACCCACCCCTCCC ACCrC 

CXTOJ.StQ CCCTC TCAC-<TCATCCCCrrCCnCA(XCCCA.XCACAACTCAC^ 

790 800 810 820 830 

10^0 1050 1060 1070 1080 1090 

C^TOA.StQ CTCACCTTTCTCACCTCa'ACCACTTCCACCTCCACTCCCACCACCACCCCTCXCCC^ 

CATO J . ScQ ATCACmCCTCACATCCTACCACmCACCTCCACTCCCACCACCACACCTCCC 
8^0 850 £60 870 880 890 

UOO nio 1120 U30 \\L0 1150 

CRT0A.Sc.q <^cccCTCxrrcccACCT(xccAACTCCcccca:crcTC7ca:cc^^ 

CRTOJ.SCQ CCCCCCTCGTCCCACCTCCCCCACrCCCCCC^^ 

900 910 920 930 9^0 950 

1 160 1 170 M30 M90 1200 

CR TCA . S ZQ T ACCTCCaCaCaCTCC ACTCCCCCCTCCTC - -CC aCC - TCCCCaTCCaCCTTC TCCCaCC 

CR'fOi.StQ TTC--CCATCACC7CCTCCCTCCCCrCCTCACCCACCCTCTC^ 

960 970 980 990 1000 lOlO 

1210 1220 
CRTOA. SCQ AUCCCTCACCT 

CRTOJ.SEQ ACACCCTCCTCA 
1020 



wo 98/18910 



PCT/US97/17819 



4/11 
FIG. 3 

1 10 20 30 60 50 

CRTOA.AMI HQUL\TVHLEQLTCSA£ALK£K£KEVAGSSDVLRTVATQlfSLPStESOAARPCLKKAY 

CRTOJ . AMI KHVASALKVEQK-CS-EA AASSPO'A-RAWAT^WMPSESSOAARPALKKAt' 

1 10 20 30 40 

60 70 80 90 100 110 

CRTOA.AMI JCPPPSDTKCITMALRVICSWAAVFLHAIFQIKLPTSLDQLHWLPVSDATAQLVSCTSSLL 

CRTOJ .AMI KPPASDAKCITMALTlicmAVFLHA^ 

50 60 70 80 90 100 

. 120 130 140 150 160 170 

CRTOA.AMI DIVWFFVLEFLYTCLFinHDAKHCTIAMRHRQLKDfLCRVCISLYAHFOYNMLKRKHW 

CRTOJ .AMI HlAAvhCiEFLYTCLFiTTHDAlW 

110 120 130 140 150 160 

180 190 200 210 220 230 

CRTOA.AMI EHHNHTCEVCig)PDFHRCNPCIVPWFASFMSSYMSWQFARLAWV.TVVKQLLCAPKANLL 

CRTOJ .AMI EHHNHTCEVCKDPDraCNPGLVTVFAS^^ 

170 180 ISO 200 210 220 

260 250 260 270 280 290 

CRTOA.AMI VFMAA\P ILSAFRLFYFCTYMPKKPEPCAASCSSPAVMNWKSRTSQASDLVSFLT 

CR.TOJ./jU VtHA.V'>?ILSArRLfYtCTYL?Ku?£PC?AAGSQ--VHVnTR.'JCT5£ASDV-hScLT 
230 240 250 260 270 280 



CRT0A..AMI 
CRTOJ. AMI 



300 310 320 329 

CYHFOLKWEHHRWPFAPMWELPNCRRLSCRCLVPA 



CYHFDLHWEHHRWPFAPWWQLPHCRRLSCRCLVPALA 
290 300 310 320 



wo 98/18910 PCTAJS97/17819 

6/11 



O 

d) 
Ll 



GC 

8 

UJ 



PQ 



CO 

o 
c 

< 

Q. 



O 
CD 
CM 



~ c <n • 



X 



CO 

O 

c 



CO 

O 

c 

< 

cx 



cn 

< 
cl 



cx 

o 
o 

CO 



CL 

o 
o 

CO 



CL 

o 
m 

CM 



Xi 

o 

CO 



wo 98/18910 



PCT/US97/17819 



7/11 



-J 

E mi 



CO 

UL 



to 
o 
c 

< 



X 



X 
E 



to 
o 
c 

CL 



Ql 



O 

c 

< 

cx 



cn 
< 

CL 



CL 

O 
CO 
Csl 



CL 
CO 



CL 

O 
O 
CO 



JD 

O 
O 
CO 



CL 
O 

to 

CL 
JQ 

O 
CO 



H 
a. 



wo 98/18910 



PCT/US97/17819 



8/11 



CM 

T — 

• MM 

LL 



p 









O 




•c 




< 


VI 

m -J 


Ct 



to 





o 








QC 




O 








Ql 






CO 




O 




c 




Q- 




Hid 












O 




c 




< 




CL 












< 




Q. 



o 

CO 



CL 



CL 
CO 



CL 

o 

o 

CO 



CL 

o 
o 

CO 



CL 

O 

to 

CM 

CL 
X> 

O 
CO 
CM 



O 
H 

O 
H 

1^ 



wo 98/18910 



PCT/US97/17819 



9/11 



Figure 13 



10 9 6 4 3 2 WT 




4 



4 



wo 98/18910 PCT/US97/17819 



10/11 





INTERNATIONAL SEARCH REPORT 


International application No. 
PCT/US97/17S19 


A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) : C12N 9/02, 1/20, 15/09, 15/63; C12P 13/00 
US CL : 435/132, 189, 252.3, 252.33, 419, 320.1; 536/23.2; 800/205 

According to International Patent Classification (IPC) or to both national classification and IPC 


B. FIELDS SEARCHED 


Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/132, 189, 252.3, 252.33, 419, 320.1; 536/23.2; 800/205 


Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (name of data base and, where practicable, search tenns used) 
Please See Extra Sheet. 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 
Y 


KAJIWARA et al. Isolation and functional identification of a novel 
cDNA for astaxanthin biosynthesis from Haemaiococcus pluvialis, 
and astaxanthin in Escherichia coli. Plant Molecular Biology. 
October 1995, Vol. 29, pages 343-352, see abstract. 


2, 6, 8, 10, 14, 

16, 17, 19-23 

24-33 


X 
Y 


LOT AN et al. Cloning and expression in Escherichia coli of the gene 
encoding p-C-4-oxygenase, that convert p-carotene to the 
ketocarotenoid canthaxanthin in Haefnatococcus pluvialis. FEBS 
Letters. 08 May 1995, Vol. 364, pages 125-128, see entire 
document. 


2, 6, 8, 10, 14, 
16, 17, and 19-23 


24-33 


1 x| Further documents are listed in the continuation of Box C. } } Sec patent family annex. 


* Special categones of cited documeoU: 

* A* document defuung (he general itate of the art which b not ccmsidered 

lo be of particular relevance 

'E* earlier doctment publiihed on or afler the inlomational filing date 

'L* document which may throw doubta on priority claim(a) or which ia 
cited to eatibliab the publication date of another citation or other 
apeeial reaaon (aa apocified) 

*0* dooument referring to an oral discloeure, use* exhibition or other 
meana 

*P* document published prior lo the international filing date but later than 
the priority date claimed 


"T* later document published after the tnteniatioiial rding date or priority 
data and not an conilict with the applicalion but cited lo underaland 
the principle or theory uiiderlying the inventkm 

*X* document of particular relevance; the claimed invention cannot t>e 
ooneidered novel or cannot be eonaidered to involve an inventive step 
when the doctiment ia taken alone 

*Y* document of particular relevance; the claimed invention cannot be 
eonaidered to involve an inventive atep when the document ia 
combined with one or more other euch documeoia, auch ccMnbinalioo 
being obvioua to a penon tkilled in the mst 

document member of (he aame patent family 


Date of the actual completion of the international search 
11 DECEMBER 1997 


Date of mailing of the international search report 

2 6 JAN \m 


Name and mailing address of the ISA/US 
Commtsaioner of Patents and Trademarks 
Box PCT 

Washington, D.C 20231 


Authorized officer 
NASHAAT T. 


WASHED 





INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/17819 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, vnth indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 96/06172 Al (KIRIN BEER KABUSHIKI KAISHA) 29 
February 1996, page 1, abstract. 


17 


Y 


2, 10, and 19-33 


X.P 


HARKER et al. Biosynthesis of ketocarotenoids in transgenic 
cyanobacteria expressing the algal gene for p-C-4-oxygenase, criO. 
FEBS Letters. 03 March 1997. Vol. 404, pages 129-134, see 
abstract. 


2, 6, 8, 21-25 


Y,P 


10. 14, 16. 17, 
19, 20 and 26-33 


A 


AUSICH, R. L. Production of carotenoids by recombinant DNA 
technology. Pure & Appl. Chem. 1994, Vol. 66, pages 1057-1062, 
abstract. 


24-25 


A 


MISAWA et al. Canthaxanthin biosynthesis by the conversion of 
methylene to keto groups in a hydrocarbon (J-carotene by a single 
gene. Biochem. Biophys. Res. Comm. 26 April 1995, Vol. 209, 
No. 3, pages 867-S76. 


2, 6. 8. 10, 14, 
16, and 19-33 


A 


DEMMIG-ADAMS, B. Carotenoids and photoprotection in plants: 
a role for the xanthophyll zeaxanthin. Biochem. Biophys. Acta. 
1990, Vol. 1020, pages 1-24. 


24-33 


A 


KRINSKY, N. I. Antioxidant function of carotenoids. Free Rad. 
Biol. Med. 1989, Vol. 7, pages 617-635. 


26 and 27 


A 


SANDMANN, G. Carotenoid biosynthesis in microorganisms and 
plants. Eur. J. Biochem. 1994, Vol. 223, pages 7-24. 


24 and 25 



Fomi PCT/ISA/210 (continuation of second sheet)(July 1992)«^ 



It 



INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/US97/17819 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established in respect of ceitain claims under Article 17(2)(a) for the following reasons: 
1. I I Claims Nos.: 

' because they relate to subject matter not required to be searched by this Authority, namely: 



2. Claims Nos.: 1, 3-5. 7, 9, 11-13, 15 and 18 

I— ' because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 

The claims are drawn to specific nucleic acid and amiiio acid sequences of SEQ ID NO: 2 and SEQ ID NO: 4; 
respectively. Applicants have not provided the sequences in computer readable form to search the sequences in 
commercial data bases. Thus, the claims could not be meaningfully searched. 

3. I I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentenced of Rule 6.4(a). 



Box II Observations where unity of invention is lacking; (Continuation of Item 2 of Rrst sheet) 
This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1 . Fxi As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. j I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. I I As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. I I No required additional search fees were timely paid by the applicant. Consequently, this international search tcpoti is 
restricted to the invention first mentioned in the claims; it is covered by elaims Nos.: 



Remark on Protest | \ The additional search fees were accompanied by the applicant's protest. 

\ j No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet(l))(July 1992)* 



9 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/17819 



B. RELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

APS, STN: Medline, Caplus, Scisearch, Lifesci, Biosis, Embase, Wpids, Agricola, and Biotechds. Search terms: 
haematococcus, pluvialis, DNA, cDNA, Sequence, Astaxanthin, canthaxanthin, echinenone, isocryptoxanthin, 
crytoxanthin, hydroxyechinenone, zeaxanthin, adonirubin, adonixanthin and crtO. 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which arc not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group I, claims 2, 6, 8, 10, 14, 16, 17, and 19-27, drawn to DNA coding for an oxido-reductase which is involve in 
the biosynthesis of asUxanthin, the enzyme of SEQ ID NO: 4, vector, host cell and a method of making xanthophylls. 

Group II, claims 28-33, drawn to a transgenic plants, and a special vector for transfomning plants. 

The inventions listed as Groups 1 and II do not relate to a single inventive concept under PCT Rule 13,1 because, under 
PCT Rule 13,2, they lack the same or corresponding special technical features for the following reasons: The special 
technical feature of the invention of Group I is the DNA coding for the oxido-reductase of SEQ ID NO: 4. Group I 
encompass the DNA, the enzyme, vector containing the said DNA, host cell and a single use for the DNA which is the 

method of pmducing xanthophylls. The invention of Group 11 is drnwTi to a transgenic plant containing the DNA of 
Group I and vectors useful for transfonming plants. Thus, the invention of Group H represent a second use for the 
DNA. Thus, the inventions of Groups I and II arc not so linked v/iih a special technical feature as to- form a single 
inventive concept under PCT Rule 13.1, see 37 C.F.R. 1.475(d), 



Fonn PCT/ISA/210 (extra sheet)(July 1992)* 



